Phillip Rhodes · @mindcrime
43 followers · 172 posts · Server fosstodon.org

Working on multimodal embodied AI forces to you learn a lot, about a lot of stuff. Focusing on the "hearing" subsystem for right now, so having to spend a lot of time on signal processing stuff that I don't know much about. Learning everything from ALSA, JavaSound API, etc. through audio file formats (PCM, WAV, etc.), to spectral analysis, FFT, DFT, and so on. And that's just to front-end the work on actual speech recognition. Wheeew!!!

#BozBox #norestforthewicked

Last updated 3 years ago

Phillip Rhodes · @mindcrime
43 followers · 172 posts · Server fosstodon.org

One other angle I may pursue for is giving it the ability to sense the state of its onboard power supply (when not plugged into the wall) so it will have something on which I can base a notion of something analogous to the "hunger" that we humans experience.

Will that prove to be important, or even useful? No idea, but I'm leaning towards implementing it nonetheless.

#BozBox

Last updated 3 years ago

Phillip Rhodes · @mindcrime
43 followers · 172 posts · Server fosstodon.org

So far captures motion data using a 9DOF accelerometer/magnetometer/gyroscope board, receives coarser grained motion and location data from a GPS receiver, and now is just about to start capturing audio.

Vision will be next after that, and then some random things that I might or might not do: light sensors, barometric pressure, temperature, etc.

#BozBox

Last updated 3 years ago

Phillip Rhodes · @mindcrime
43 followers · 172 posts · Server fosstodon.org

So, what *is* this thing anyway, you might be asking.

Simply put, it's a "box of sensors" that I can carry around that will experience the physical world with those various sensors, and save multi-modal data about its experiences, so I can play with ideas about how an AI (or possibly a human) learns a "naive epistemology of the world", "intuitive physics of the world", language usage, naive epistemology, etc.

#BozBox

Last updated 3 years ago

Phillip Rhodes · @mindcrime
17 followers · 40 posts · Server fosstodon.org

Making progress on the project, in terms of receiving audio input. Starting to understand how works. That's something I *never* thought I'd say!

The tricky part is working with two microphones so we can have stereo hearing, analogous to human hearing.

The second tricky part is getting the sound into code. Use JNI? Or maybe use plain old arecord and a shared memory-mapped file? Or "other"? Hmm...


#BozBox #alsa #java #linux #ai

Last updated 3 years ago

Phillip Rhodes · @mindcrime
17 followers · 40 posts · Server fosstodon.org

What's going on today? Got about 8 miles in on the mountain bike at Carolina North Forest, now sitting at Barnes & Noble in Durham working on our "box of sensors" project for experimenting with .

#BozBox #EmbodiedAI

Last updated 3 years ago