Closed

Voice pitch tracking and recording app.

I need visual voice pitch tracking and recording app.

This is for evaluation purpose, i need just working prototype, so most of settings can be done with constants for now

App should do at the same time:

- play mp3/wav (from memory stream io)

- scroll smoothly long-wide image

- track accurate pitch (need to select one of existing free libs or ready to use code)

- draw pitch on that image (draw always at static position on screen and scroll image)

- record into memory stream object in wav format, adjust delay to be synced with playable file

After playing/recording it should:

- allow to scroll by mouse/touchpad generated image (original + tracking) to check pitch hittness visually and need to make some scrollbar

- allow to save recorded wav file

- restart again by button

Will be 2 modes. main and loop mode.

Loop mode differense: it should allow make a loop of range (in float seconds) with auto restart (clear generated image, clear recorded io stream, then pause for defined amount of seconds (or wait for key press, 2 options), and start from loop starting position). Stop by button, recorded stream should store only last try from loop start in memory and allow to save it by button.

If the loop range fit into screen - no scroll, just move pitch pointer over screen.

There are many ready to use pitch tracking libs, need to research which of them can do realtime accurate pitch detection. The complexity in that libs not adapted for realtime use, need adapt them. Just FFT not work here, just detecting note its not that i need.

I've tested 2 libs for python with neural network and with pyin implementation, both detecting pitch very well, but not suitable for realtime.

i also know one lib for C++ which detect reasonably and can work in realtime.

Scrolling and pitch tracking should look like in vocaberry app on android/ios

In future or if budget will allow to do in that project:

- after recording it should allow to playback mixed source mp3 with recorded one. at least should be taken into account while developing code now.

- it should allow to playback while recording 2 mixed mp3/wav with different volume at the same time.

So it should be taken into account in developing.

Requirements:

- use asio for play and recording, settings should be configured (for now just defined constants is fine 24bit, 48khz, read at once samples count should be read from asio device default/current settings). portaudio can be used.

- use c/c++ with qt, delphi, possibly python+qt, not c#

- scrolling of image should be smooth, so need to use gl or directx or something

- scrollable image should be synced with samples count played (not just timer)

- all should not be much cpu intensive, possibly use thread for pitch lib

- definable volume level under which it will not track pitch (no draw on image)

Image will have header and footer size (definable), inner part splitted equally to definable notes range (frequency ranges). Notes specified as [CDEFGAB][1-6][#]. So with image will be image definition file with parameters:

- notes range, example e1-a2

- header size

- footer size

For debugging purpose asio4all may be used and some cheap microphone.

Also before starting or even bidding please try vocaberry to understand how it should work.

Tracking lib should be pretty precise (at least like in vocaberry) and delay minimized as much as possible. Also sometimes vocaberry mistakes for octave for few milliseconds, so need to somehow workaround this issue (sometimes frequency peak +-1 octave have more power for small amount of time).

In future in case of success (it will be tested on some sample of peoples) i need to port it to linux (jackd+qt/gl), so code should be ready to port, no need to do cross-platform for now, but better if it will use qt or gl

I will pay only after task is fully done and tested by me, i will not pay for parts. Budget for this prototype up to 500$, if you think that its not possible to fit into budget - just skip.

Skills: Qt, C Programming, Delphi, C++ Programming, OpenGL

See more: recording machine greetings voice sample, recording movie trailer voice, jmf voice pitch, voice changer asterisk recording, voice recording simple script voice mail greeting, voice actors ivr recording, recording sounds male voice sound female, voice sample audio recording, iphone apps repeat voice pitch, kindle app text voice, iphone app change voice real time, voice pitch asterisk, app changes voice, mobile app changes voice, amazon app text voice, android app video voice, voice changer app female voice, voice pitch changer app, voice pitch app

About the Employer:
( 0 reviews ) Kirov, Russian Federation

Project ID: #31804647

5 freelancers are bidding on average $620 for this job

hsh564cf84accd96

We will do your C++ work I am writing this proposal in order to work for you in Software and Web Development. We are highly trained professional developers seeking to freelance and earn online. Having a flair in progr More

$500 USD in 7 days
(31 Reviews)
5.0
(3 Reviews)
3.1
andrewdai11191

Hello Thanks for your job posting I read your job description carefully I am an Python, QT, C++ expert I have a lots of experience in Voice processing If you work with me, you will get best good results Please give me More

$500 USD in 15 days
(0 Reviews)
0.0
branthanmar

Thanks for the detailed info. sounds really interesting. I am a python programmer with seven years experience Great expertise in digital signal processing in python and matlab. We will do the code. Thank you

$350 USD in 4 days
(0 Reviews)
0.0
ChuckkH

What you are asking for is extremely complex. Of course, just applying a pitch-tracking library is not so complicated, but matching it all up to the interfaces you demand would take a good amount of time. I wouldn't tr More

$1499 USD in 30 days
(0 Reviews)
0.0