Skip to content

praveenVnktsh/soundsense

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Directory structure:

models
    baselines
        cnnlstm
            model.py
            ...
        mulsa
            model.py
            ...
test
    test_*.py
model_inference.py        

Keyboard teleop

w - move up s - move down a - move left d - move right

i - extend arm j - retract arm

l - roll right j - roll left

m - close gripper n - open gripper

Instructions

You will need 3 terminals: Terminal 1 - roslaunch stretch_core stretch_driver.launch (background terminal)

Split terminal into two panels: terminal 2 - grab_audio.py

terminal 3 - teleop_collector.py (control on this)

After collecting one data point, ensure that the audio recorded is correct (play music near the camera and see if it is correct). Once everything looks okay, you can start collecting all the datapoints.

Audio README

roslaunch audio_capture capture.launch format:="wave"

Audio is of format - S16LE (signed 16 bit (int16) little endian)

  • Before running inference, run pavucontrol and ensure that the recording device is the stereo osmo action 3 camera.
  • Also ensure that the input device in settings is osmo action 3 camera. Output is internal headphones.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •