Talk:Waveform analysis/Doh ray me

From Rosetta Code

Explanation

Solfa Trigraphs? Could you add more explanation/links to help those not from that audio processing background? Thanks.--Paddy3118 13:46, 12 July 2011 (UTC)

http://en.wikipedia.org/wiki/Tonic_sol-fa
These have used digraph spelling variant with a single reh. (do, reh, me, fa, so, la, te, do), but I don't suppose that matters. You must remember Julie Andrews: Doe - a deer a female deer, Ray - a drop of golden sun, etc :) Markhobley 14:03, 12 July 2011 (UTC)

Average pitch or average frequency?

I think for this task we could use either average pitch or average frequency to achieve this, although I am fairly open minded as to what methods should be used here.

I think we first need to examine the waveform to determine if it is fairly even. (Hopefully, it will just be a "lah" sound from the microphone.

Maybe find the range of the frequencies used, and hopefully that will fall into one of our solfa bands. If not, then it might be a just a case of deciding between two notes. Markhobley 10:55, 13 July 2011 (UTC)

If the given waveform is fairly even, then I am thinking that we could take the modal result of the frequencies used throughout the given waveform and use the mathematical mode for our average, or if the results do not produce a mode, then we take a median.

This would really be the modal result of bands of frequencies within the waveform. Markhobley 10:55, 13 July 2011 (UTC)

Others have suggested Fast Fourier Transformation and Enhanced Autocorrelation functions to determine the pitch.

Ideally we want an algorithm that can make the decision reasonably quickly, so that it can be used in sing along type applications, where temporal media is used to provide the waveform, and the results are displayed on the screen as notes are sung into the microphone.

Markhobley 18:03, 12 July 2011 (UTC)