faz - Stardance

5h 17m 15s logged

i did a ton of work in the past day or two, unfortunately i forgot to do a devlog so over 5 hrs of working time has passed.

i made working circuits for a ton of different components: LED bar, PWM with LED, RGB LED, neopixel (8 LEDs in one component), active + passive buzzer, touch sensing (with a loose wire as a sensor). potentiometers, photoresistors, and thermistors.

started experimenting with more complex circuit designs for subparts of the final project like working with multiple potentiometers to create a RGB LED circuit that can change to any colour my twisting the potentiometers (image attached).

the photoresistor + LED circuit was interesting as the output of the LED affected the photoresistor output, so i would need to shield the photoresistor from it.

the buzzers allowed me to get some proper sound out (like a sine wave sound) so ill continue experimenting with that later down the road.

Open comments for this post

@faz on hardware project · 21 days ago

1h 32m 33s logged

LED + Button

spent some time connecting esp32 and installing the driver + micropython.

designed some circuits on TinkerCAD with arduino instead of esp32

got used to some circuits like momentary push button and also non momentary one to light up an LED. wrote code for all of this.

see below for the latest one i did

Open comments for this post

@faz on speech to text · 22 days ago

1h 1m 20s logged

Devlog 01

started my speech-to-text project today

got microphone input working with SoundDevice and set it up so it constantly listens in small chunks. RMS volume to figure out whether I’m speaking or not, and then storing only the audio that actually contains speech

i also added silence detection so when I stop talking for about half a second, it assumes I’ve finished a sentence and sends the audio off to Faster-Whisper for transcription. transcription runs in a separate thread so the microphone can keep listening while Whisper does its thing

had to mess around with locks and a few state variables to stop duplicate transcriptions from happening, but it’s working pretty well now. right now it can listen, detect when I’m speaking, wait for me to finish, and then print the transcribed text automatically.

still needs some tweaking though. the silence detection isn’t perfect and I’m creating a lot of threads right now, which probably isn’t the best approach long term

start of the code is in the photo, i’m at like 102 lines