Friday, February 22, 2013

sound files + bleh consonants

Made a matrix, downloaded some sound files (sources below). Pengfei's matlab code now saves graphs automatically because I got too lazy to save them by hand C: (tehe I love matlab)

http://beta.freesound.org/people/janmario/downloaded_packs/
http://www.phonetics.ucla.edu/course/chapter1/chapter1.html

I think I need to record my own sound files. These sound files I downloaded have too much vowels in them. The image below is the 'p' sound (as in "lip"). The sound file I downloaded from online is really "pa". This makes sense because a consonant is a-periodic, and thus you can't really pronounce the consonant without the vowel. What this means in terms of the graph below, is that the "p" sound is really the vertical blue lines at the beginning and the "a" is the red part. I don't think pengfei's code (as of right now) can deal with the level of detail consonants need to be evaluated at.




At some point, (after figuring out the consonants), we need to catagorize different consonants so we can create a HCA tree.


And because these images are so pretty C:

Above: the vowel in "hot"
Below: the 'm' in "am"


And again, the "hot" sound is fine because it is a vowel. the "m" sound is really "ma" and you can see what we need to take out is just the blue stripe in the beginning. The rest of the data is really an unnecessary "a" sound.


Actually looking at all these graphs. It reminds me of this art project one of my teachers showed me. Spoken word is actually really werid and chaotic. Breaking speech down into phoneme works to a certain extent, but it is actually a pretty bad way of reproducing speech. The artist in the video recorded himself speaking all the different phonemes, and tried to speak with by pasting the phonemes together. (You can clearely see that it doesn't work well).



No comments:

Post a Comment