Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Selectable Mode Vocoder
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Technical specification== ===Codecs=== The SMV for [[Wideband]] [[code-division multiple access|CDMA]] is based on 4 codecs: full rate at 8.5 kbit/s, half rate at 4 kbit/s, quarter rate at 2 kbit/s, and eighth rate at 800 bit/s.<ref name="3gpp2 smv"/> The full rate and half rate are based on the [[Code Excited Linear Prediction|CELP]] [[algorithm]]<ref name="3gpp2 smv"/> that is based on a combined closed-loop-open-loop-analysis (COLA). In SMV the signal frames are first classified as: * Silence/Background noise * Non-stationary unvoiced * Stationary unvoiced * Onset * Non-stationary voiced * Stationary voiced ===Algorithm=== The algorithm includes [[voice activity detection]] (VAD) followed by an elaborate [[Frame (Artificial intelligence)|frame]] classification scheme. Silence/background noise and stationary unvoiced frames are represented by [[frequency spectrum|spectrum]]-[[modulated]] noise and coded at 1/4 or 1/8 rate. The SMV uses 4 subframes for full rate and two/three subframes for half rate. The stochastic (fixed) codebook structure is also elaborate and uses sub-codebooks each tuned for a particular type of speech. The sub-codebooks have different degrees of pulse sparseness (more sparse for noise like excitation). SMV scores a high of 3.6 [[Mean Opinion Score|MOS]]<ref name="nokiamos">{{Cite web|url=http://europe.nokia.com/library/files/docs/Makinen2.pdf|title=Performance Comparison of Source Controlled GSM AMR and SMV Vocoders|access-date=2009-05-26|publisher=Nokia Research Center, Multimedia Technologies Laboratory|author1=J. Makinen|author2=P. Ojala|author3=H. Toukomaa|format=PDF}}{{dead link|date=May 2018 |bot=InternetArchiveBot |fix-attempted=yes }}</ref> at full rate with clean speech. The coder works on a frame of 160 speech samples (20 ms) and requires a look ahead of 80 samples (10 ms) if noise-suppression option B is used. An additional 24 samples of look ahead is required if noise-suppression option A is used. So the algorithmic delay for the coder is 30 ms with noise-suppression option B and 33 ms with noise-suppression option A. The next evolution of CDMA speech codecs is [[VMR-WB]] which provides much higher speech quality with [[wideband]] while fitting to the same networks. SMV can be also used in 3GPP2 container file format β [[3GP|3G2]].
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)