顯示具有 pcm 標籤的文章。 顯示所有文章
顯示具有 pcm 標籤的文章。 顯示所有文章

2024年1月15日 星期一

speech segmentation speech segmenting algorithm

https://www.researchgate.net/publication/221258872_a_simple_but_effective_approach_to_speaker_tracking_in_broadcast_news

https://www.researchgate.net/figure/automatic-segmentation-of-the-audio-recorded-in-the-cafeteria-noisy-environment-by_fig2_323155847

https://genekogan.com/works/field-rec-navigator/
Visualizing my field recordings
https://www.jyu.fi/hytk/fi/laitokset/mutku/en/research/materials/mirtoolbox
automatically segmented the raw recordings segmentation algorithm
https://sourceforge.net/projects/supercollider/
https://en.wikipedia.org/wiki/Principal_component_analysis
https://en.wikipedia.org/wiki/Music_information_retrieval

https://lgm.fri.uni-lj.si/research/segmentation-of-field-recordings/
Segmentation of field recordings — LGM

https://yaiglobal.com/index.php/component/k2/item/5-audio-segmentation

 https://www.researchgate.net/figure/audio-onset-segmentation-dashed-lines-variable-window-length-segmentation-empty_fig1_252187078


http://recherche.ircam.fr/equipes/temps-reel/audio-mosaicking/
Real-Time Audio Mosaicking
MFCC based Frame by Frame Audio Mosacing MEL-frequency cepstrum coefficients (MFCC).

Semi-supervised feature selection for audio classification based on constraint compensated Laplacian score | EURASIP Journal on Audio, Speech, and Music Processing | Full Text
figure 5  Accuracy as a function of the number of selected features
https://asmp-eurasipjournals.springeropen.com/articles/10.1186/s13636-016-0086-9



audio Packages DLL music Audio libraries library

 SEARCH Packages  Linux  Unix
https://slackbuilds.org/result/?search=audio&sv=15.0
https://pkgs.org/download/libopenshot-audio

https://en.wikipedia.org/wiki/Category:Audio_libraries
https://en.wikipedia.org/wiki/Category:Video_game_music_technology
category is Audio library.Pages in category "Audio libraries"

 BASS
 ClanLib
 DirectSound
 Enlightened Sound Daemon
 FMOD
 JACK Audio Connection Kit
 Libavcodec
 Miles Sound System
 Open Sound System
 OpenAL
 OpenSL ES
 PulseAudio
 Raylib
 Simple DirectMedia Layer
 UFMOD

Audio libraries Raylib ClanLib Libavcodec PulseAudio
https://packages.altlinux.org/en/p10/srpms/libopenshot-audio/
https://ru.wikipedia.org/wiki/%D0%9A%D0%B0%D1%82%D0%B5%D0%B3%D0%BE%D1%80%D0%B8%D1%8F:%D0%90%D1%83%D0%B4%D0%B8%D0%BE%D0%B1%D0%B8%D0%B1%D0%BB%D0%B8%D0%BE%D1%82%D0%B5%D0%BA%D0%B8

libopenshot-audio JUCE  audio  library

audio/opus-tools
https://en.wikipedia.org/wiki/Opus_(audio_format)
https://wiki.xiph.org/Opus-tools

libs For Qt5. pulseaudio  qt
https://archlinux.org/packages/extra/x86_64/pulseaudio-qt/
https://packages.fedoraproject.org/pkgs/pulseaudio-qt/pulseaudio-qt-qt5/

https://github.com/OpenShot/libopenshot-audio
OpenShot Audio Library libopenshot-audio OpenShot Audio Library (libopenshot-audio) is a free, open ...
GitHub https://github.com › OpenShot › libope...
OpenShot Audio Library (libopenshot-audio)


https://www.haskell.org/
haskell Packages bindings audio haskell  bindings audio
https://hackage.haskell.org/package/htaglib
https://hackage.haskell.org/package/jack

 Octave packages audio Audio and MIDI Toolbox for GNU Octave. SHA256

https://en.wikipedia.org/wiki/UFMOD


pectrum-devel pkgs top-level/all-packages
https://spectrum-os.org/lists/archives/spectrum-devel/c22a62456db/s/?b=pkgs/top-level/all-packages.nix

https://discourse.julialang.org/t/package-trouble-again/104977


https://ctepp.calstate.edu/tlab-change-packages

Steam Audio supports the following platforms:

 https://valvesoftware.github.io/steam-audio/doc/capi/getting-started.html
Steam Audio supports the following platforms

Steam Broadcasting
Steam Support
https://help.steampowered.com › view
 
Steam Broadcasting is currently supported by the following browsers: Steam Client; Google Chrome (version 39+); Apple Safari (version 8+ on macOS); Internet ...

Modern audio compressioninternet. Opus audio format Opus lossy audio coding format Xiph.Org Foundation standardized code speech

 https://en.wikipedia.org/wiki/Opus_(audio_format)
Opus (audio format) - Wikipedia
Opus is a lossy audio coding format developed by the Xiph.Org Foundation and standardized by the Internet Engineering Task Force, designed to efficiently code speech and general audio in a singl

xiph/opus: Modern audio compression for the internet.

Opus Codec
https://opus-codec.org/



Opus Interactive Audio Codec
Overview

Opus is a totally open, royalty-free, highly versatile audio codec. Opus is unmatched for interactive speech and music transmission over the Internet, but is also intended for storage and streaming applications. It is standardized by the Internet Engineering Task Force (IETF) as RFC 6716 which incorporated technology from Skype’s SILK codec and Xiph.Org’s CELT codec.
Technology

Opus can handle a wide range of audio applications, including Voice over IP, videoconferencing, in-game chat, and even remote live music performances. It can scale from low bitrate narrowband speech to very high quality stereo music. Supported features are:

    Bitrates from 6 kb/s to 510 kb/s
    Sampling rates from 8 kHz (narrowband) to 48 kHz (fullband)
    Frame sizes from 2.5 ms to 60 ms
    Support for both constant bitrate (CBR) and variable bitrate (VBR)
    Audio bandwidth from narrowband to fullband
    Support for speech and music
    Support for mono and stereo
    Support for up to 255 channels (multistream frames)
    Dynamically adjustable bitrate, audio bandwidth, and frame size
    Good loss robustness and packet loss concealment (PLC)
    Floating point and fixed-point implementation

2024年1月13日 星期六

rtsp Streaming Media streaming library LIVE555 Media Server Proxy Server HLS Proxy vobStreamer streaming DVD RTP/RTCP/RTSP

https://en.wikipedia.org/wiki/Real-Time_Streaming_Protocol
LIVE555 Streaming Media
This code forms a set of C++ libraries for multimedia streaming, using open standard protocols (RTP/RTCP, RTSP, SIP). These libraries - which can be compiled for Unix (including Linux and Mac OS X), QNX (and other POSIX-compliant systems) - can be used to build streaming applications. The libraries are already being used to implement applications such as the "LIVE555 Media Server", "LIVE555 Proxy Server", and "LIVE555 HLS Proxy" and "vobStreamer" (for streaming DVD content using RTP/RTCP/RTSP). The libraries can also be used to stream, receive, and process MPEG, H.265, H.264, H.263+, DV or JPEG video, and several audio codecs. They can easily be extended to support additional (audio and/or video) codecs, and can also be used to build basic RTSP or SIP clients and servers, and have been used to add streaming support to existing media player applications, such as "VLC" and "MPlayer". (For some specific examples of how these libraries can be used, see the test programs below.)
https://girishjoshi.io/post/stream-a-video-over-rtsp-using-live555mediaserver/

串流伺服器特性剖析 - 國立交通大學國立陽明交通大學機構典藏 https://ir.nctu.edu.tw › bitstreamPDF由 吳宗修 著作 · 2006 — A streaming server will consists of three important modules, they are " Set up a standard connection procedure "," RTSP Signaling Negotiation " and " packet ...

https://github.com/bluenviron/mediamtx
RTSP Pull - OvenMediaEngine
https://airensoft.gitbook.io/ovenmediaengine/live-source/rtsp-pull-beta
https://gstreamer.freedesktop.org/documentation/gst-rtsp-server/rtsp-server.html?gi-language=c
https://github.com/aler9/rtsp-simple-proxy

RTSP/RTP streaming support for MPlayer
http://www.live555.com/mplayer/
RTSP SDK Libraries for Windows, .NET 6+, .NET Framework, C#, VB, C/C++, and Python | LEADTOOLS
https://www.leadtools.com/sdk/multimedia/streaming/rtsp



Managed Media Aggregation using Rtsp and Rtp - CodeProject
https://www.codeproject.com/Articles/507218/Managed-Media-Aggregation-using-Rtsp-and-Rtp


RTSP & RTP Client, Broadcaster & Server Library | VASTreaming
https://www.vastreaming.net/rtsp-library.html
rtsp RTP  Streaming Media  streaming library

https://github.com/ekumenlabs/AndroidStreamingClient/blob/master/android_streaming_client/src/main/java/com/c77/androidstreamingclient/lib/rtp/RtpMediaDecoder.java
 android streaming client lib
 DataPacketTracer
 MediaExtractor
 RtpMediaDecoder
 RtpMediaExtractor
streaming data packet tracer decoder extractor Naishy/rtpsplit: RTP stream extractor GitHub - Linaro/OpenCSD: CoreSight trace stream decoder


rtp streaming data extractor
https://wiki.wireshark.org/rtp_statistics

speech audio processing coding enhancement audio library adpcm acelp pulse density


speech Audio codecs adpcm acelp  Audio codecs

Vocoders, Audio Codecs and Speech Compression Software
GAO Research
http://www.gaoresearch.com › products
ITU-T Vocoder Standards for Speech Processing Software and Audio Processing Codecs ; ITU-T G.723.1, 6.3 and 5.3 kbit/s, MP-MLQ, and ACELP based codec ; ITU-T G.

Comparison of audio coding formats - Wikipedia
https://en.wikipedia.org/wiki/Comparison_of_audio_coding_formats

TwoCC - MultimediaWiki
https://wiki.multimedia.cx/index.php/TwoCC
The TwoCC is the audio counterpart to the video FourCC. It is the audio format identifier used in the RIFF based multimedia formats by Microsoft (WAV and AVI). The TwoCC is 2 bytes long and stored in little endian format on disk. You can register your TwoCC with Microsoft but it seems that only some companies perform this process.
https://wiki.multimedia.cx/index.php/Category:Audio_Codecs

G.7xx: Audio (Voice) Compression Protocols (CODEC) (PDF)
Transcoding of Voice Codecs G.711 to G.729 and ... 
PDF iTu T G.7xx Standards for Speech Codec

https://academic-accelerator.com/encyclopedia/zh/g-722
g.722 G 722: 最新的百科全書、新聞、評論和研究

https://www.wikiwand.com/en/List_of_video_compression_formats
List of codecs - Wikiwand

Non-compression    Linear pulse-code modulation (LPCM, generally only described as PCM) is the format for uncompressed audio in media files and it is also the standard for CD-DA; note that in computers, LPCM is usually stored in container formats such as WAV, AIFF, or AU, or as raw audio format, although not technically necessary.        FFmpeg    Pulse-density modulation (PDM)        Direct Stream Digital (DSD) is standard for Super Audio CD            foobar2000 Super Audio CD Decoder (based on MPEG-4 DST reference decoder)            FFmpeg (based on dsd2pcm)    Pulse-amplitude modulation (PAM)

2024年1月12日 星期五

gsm audio speech telecommunications technology Audio Compression , communications system voice codec VoIP speech pcm amr-wb opus SPEEX

 //////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
bass library g729 g719 g722 G.726   Code-excited linear prediction

speech  telecommunications   technology Audio Compression

https://github.com/sippy/libg722

AES E-Library » Real-Time CELP Speech Coding in a Voice Response Environment
https://www.aes.org/e-lib/online/browse.cfm?elib=5530
CELP  speech Code-excited linear prediction

https://en.wikipedia.org/wiki/RTP_payload_formats
g729a acelp internet audio stream message rtp payload format for the g.729.1 audio codec rfc 4749
https://en.wikipedia.org/wiki/Category:Speech_codecs
https://en.wikipedia.org/wiki/CELT
https://en.wikipedia.org/wiki/G.729.1
https://en.wikipedia.org/wiki/Code-excited_linear_prediction
https://en.wikipedia.org/wiki/Speech_coding

https://github.com/sippy/libg722
https://github.com/wisekrakr/CommUniWise
https://github.com/wisekrakr/SIP_dev_pushToTalk
ITU G.722 Voice  message  SIP  RFC 3261 github

https://github.com/ttsou/openbts-p2.8/tree/master
https://datatracker.ietf.org/doc/rfc6366/

github  SIP AudioFrame
https://github.com/onmyway133/awesome-voip
https://github.com/sipsorcery-org/sipsorcery/issues/914
https://github.com/AGProjects/sipclients/blob/master/sip-audio-session
https://github.com/baresip/baresip/blob/main/test/call.c
https://github.com/pjsip/pjproject/blob/master/pjsip-apps/src/samples/siprtp.c

g729a acelp internet audio stream message
rtp payload format for the g.729.1 audio codec rfc 4749
https://en.wikipedia.org/wiki/RTP_payload_formats



g729a acelp rtp g.729.1 rfc4749 pdf 
JT-G729 とビット列互換な 8-32kbit 一般社団法人情報通信技術委員会 https://www.ttc.or.jp › files › JT-G729.1v5.pdf PDF 2013年11月14日 — (2) TTC標準JT-G729.1は、ITU-T勧告G.729.1に ... ACELP を用いた音声符号化方式. (2) TTC標準JT-G729付属資料A. 低 ...

RTP Payload for DTMF Digits, Telephony Tones, and. Telephony Signals. [RFC 4749] IETF RFC 4749 (2006), RTP Payload Format for the G.729.1 Audio

g729a acelp rtp g.729.1 rfc4749 pdf g.729.1 audio acelp android usacdec_acelp acelp audio stream frame

aaa android rtp audio stream frame mediacodec messenger stackoverflow mediacodec decode aac audio chunks from rtsp and play
https://stackoverflow.com/questions/48602108/mediacodec-decode-aac-audio-chunks-from-rtsp-and-play
https://developer.android.com/reference/android/media/MediaCodec.html
https://github.com/imansaleh16/Stack-Overflow-Tags-Communities/blob/master/dataset/E_llda
https://github.com/pedroSG94/RootEncoder/wiki

////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
Network ProtocolsThe following network protocols are supported for audio and video playback:    RTSP (RTP, SDP)    HTTP/HTTPS progressive streaming    HTTP/HTTPS live streaming  draft protocol :        MPEG-2 TS media files only        Protocol version 3 (Android 4.0 and above)        Protocol version 2 (Android 3.x)        Not supported before Android 3.0

https://github.com/fyhertz/libstreaming/blob/master/src/net/majorkernelpanic/streaming/audio/AACStream.java

RTP aac android.media.MediaCodec RTP STREAM github wire
https://github.com/ekumenlabs/AndroidStreamingClient/blob/master/android_streaming_client/src/main/java/com/c77/androidstreamingclient/lib/rtp/RtpMediaDecoder.java
https://github.com/ekumenlabs/AndroidStreamingClient/tree/master
RTP aac android.media.MediaCodec RTP STREAM github wire

https://www.codeproject.com/Articles/797537/Making-an-Audio-Spectrum-analyzer-with-Bass-dll-Cs

https://delphi-lab.ucoz.ru/load/17-1-0-32

Главная » Файлы » VCL » Sound and Multimedia    
Audio Tools Library v.1.4
[ Скачать с сервера (95.6 Kb) ]     10.07.2008, 02:19
By J. Faul. ATL - programming tools for manipulating with some audio file formats. The pack uncludes several components described below:

    * MPEGaudio - for manipulating with MPEG audio file information,
    * ID3v1 - for manipulating with ID3v1 tags,
    * ID3v2 - for manipulating with ID3v2 tags,
    * WAVfile - for extracting information from WAV file header,
    * OggVorbis - for extracting information from Ogg Vorbis file header,
    * MPEGplus - for manipulating with MPEGplus file information,
    * TwinVQ - for extracting information from TwinVQ file header,
    * Monkey - for manipulating with Monkey's Audio file information.









Live555 RTSP Server on Android rtsp server on android
https://github.com/papan01/Live555-server-android
http://hank5000.github.io/blog/2015/06/24/live555-rtsp-server-on-android/

developer.android.com/reference/androidx/media3/exoplayer/rtsp/reader/rtpac3reader
https://developer.android.com/reference/kotlin/androidx/media3/exoplayer/rtsp/reader/RtpPcmReader
https://developer.android.com/reference/kotlin/androidx/media3/exoplayer/rtsp/reader/RtpAc3Reader
https://developer.android.com/media/media3/exoplayer/rtsp?hl=zh-tw
https://developer.android.com/reference/kotlin/androidx/media3/exoplayer/rtsp/reader/RtpAc3Reader
https://developer.android.com/reference/androidx/media3/exoplayer/rtsp/package-summary

Bass Audio Library https://github.com/topics/bass-dll base.dll

 https://www.codeproject.com/Articles/2848/nBASS-A-sound-libary-for-NET
Un4seen Developments
https://www.un4seen.com/

BASS is an audio library for use in Win32, MacOS, Linux and PocketPC software. It's purpose is to provide the most powerful and efficient (yet easy to use), sample, stream, MOD music, and recording functions. This library was written by Ian Luck, over at Un4seen Developments. New features include Add-on plugin system, MOD position & syncing in bytes, Support for AIFF files, Floating-point sampling, More options, and More.The BASS audio library is used in MediaPortal for the default BASS audio player.

https://github.com/topics/bass-library
https://en.wikipedia.org/wiki/Bass
 https://en.wikipedia.org/wiki/AIMP

BASS audio library v2.4 PureBasic 4.20 includes. - PureBasic Forums - English
 

https://github.com/ans-hub/audio_out

https://www.team-mediaportal.com/wiki/display/glossary/BASS+Audio+Library
http://bass.radio42.com/

bass.dll play delphi
https://itecnote.com/tecnote/delphi-load-bass-dll-and-play-mp3/

https://github.com/Zaflis/nxpascal/tree/master
https://github.com/Zaflis/nxpascal/blob/master/src/Bass.pas
Delphi, C++, VB - BASS Audio Recognition Library
https://www.3delite.hu/Object%20Pascal%20Developer%20Resources/bassaudiorecognitionlibrary.html

Bass.BASS_ChannelPlay Method
https://github.com/ManagedBass/ManagedBass/blob/master/src/AddOns/BassDShow/BassDShow.cs
Un4seen Developments
https://github.com/DragonMinded/xmplay
https://github.com/DragonMinded/libnaomi
http://support.xmplay.com/
http://bass.radio42.com/help/html/743b046b-0c42-71a0-b613-799f5f0450b9.htm
http://docwiki.embarcadero.com/RADStudio/Athens/en/Libraries_and_Packages_(Delphi)
https://delphimagic.blogspot.com/2013/05/escuchar-la-radio-por-streaming.html
https://stackoverflow.com/questions/8964488/delphi-load-bass-dll-and-play-mp3

BASS.DLL GITHUB
https://github.com/ManagedBass/ManagedBass/tree/master/src/Bass/Shared/Bass

Radio streaming with bass.dll 
https://autoit.de/thread/25624-radio-streaming-with-bass-dll/

IPHLPAPI.dll  IPHLPAPI  ip helper api Iphlpapi.h header  
WSOCK32.dll
wasapi Windows Audio Session API
base-dll basswasapi.dll

https://superblt.znix.xyz/
https://superblt.znix.xyz/doc/xaudio/
XML Tweaker - SuperBLT
SuperBLT cross-platform audio API


base.dll  script plugin header
https://wiki.mairlist.com/faq:bass-plugins
autohotkey
powerbasic base.dll  script plugin header powerbasic Sound BASS.DLL API encapsulation
foobar components
powerbasic third-party-addons
sound-bass-dll-api-header-file

cwmp-data-models客戶端設備廣域網路管理傳輸協定與裝置數據模組  网管协议数据模型定义,cwmp-data-models

2024年1月6日 星期六

DY-SV17F W0974 dy1703A flash 32Mbit mp3 music plaer memory Winbond's W25X and W25Q SpiFlash® Multi-I/O Memories feature the popular Serial Peripheral Interface (SPI), densities

 DY-SV17F Audio Module Mini MP3 Player IO Trigger USB ...
 
DY-SV17F  
DY-SV17F Audio Module Mini MP3 Player IO Trigger USB Download Flash Voice Module ; Supports Recording FunctionYes ; Display SizeNone ; PackageYes ; Mode



DY-SV17F voice module integrates IO segment trigger, UART serial port control, ONE_line single bus serial port control, standard MP3 and other 7 working modes; onboard 5W Class D power amplifier can directly drive 4Ω 3~5W speaker; support MP3, WAV decoding format, onboard 32Mbit (4MByte) flash storage audio file, can connect to the computer to update audio files through USB data cable.

Support MP3 and WAV decoding formats.
Support sampling rate (KHz): 8/11.025/12/16/22.05/24/32/44.1/48.
24-bit DAC output, dynamic range support 90dB, signal-to-noise ratio support 85dB.
Onboard 32Mbit (4MByte) flash storage, you can connect the computer to update the audio file through the USB data cable.
Comes with 5W class D power amplifier, can directly drive 4Omega, 3~5W speakers.
UART serial port control voice broadcast function, can control playback, pause, song selection, volume addition and subtraction, etc., the largest selection of 65,535 songs, baud rate of 9600 bps.
Support IO trigger playback function, 8 IO ports trigger 8 tracks or 8 IO ports to trigger 255 tracks.
Support One_line single bus serial port control, can control playback, pause, song selection, volume addition and subtraction and other functions.
Support 3 configuration IO for up to 7 working mode selection.

Notice:
1."Key combination playback" refers to the restoration of the original high level after io0-io7 output the corresponding level, similar to the key trigger once; "Level combination playback" refers to the io0-io7 output of the corresponding level to maintain the same level.
3.The difference between "I/O combination (independent) mode 0" and "I/O combination (independent) mode 1" is that the former mode continues to play the current track until the end after releasing the level, while the latter mode immediately stops playing the track after releasing the level.

Package Includes: DY-SV17F MP3 Voice Module

25Q32   SPI NOR Flash
BY25Q32BS 32M-bit Serial Peripheral Interface(SPI) Flash memory

https://www.winbond.com/hq/product/code-storage-flash-memory/serial-nor-flash/?__locale=en

Winbond's W25X and W25Q SpiFlash® Multi-I/O Memories feature the popular Serial Peripheral Interface (SPI), densities from 512K-bit to 512M-bit, small erasable ...

25Q16  Serial  SPI Serial NOR flash  Memory FLASH
MACRONIX MX25V1635FZNI
Flash Memory, Serial NOR, 16 Mbit, 2M x 8bit, SPI, WSON, 8 Pins

https://forum.openwrt.org/t/two-spi-nor-flash-chips-in-parallel-possible/30529/4
Two SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendations - OpenWrt Forum
Two SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendations - OpenWrt ForumTwo SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendations - OpenWrt Forum Two SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendation (But use NOR SPI Fash instead SD-Card)

https://forum.openwrt.org/t/two-spi-nor-flash-chips-in-parallel-possible/30529
https://www.digikey.tw/zh/schemeit/project/detail/huzzah-cc3000-wifi-breakout-1469-FASS1RO100S0

https://e2e.ti.com/support/microcontrollers/msp-low-power-microcontrollers-group/msp430/f/msp-low-power-microcontroller-forum/708763/msp432e401y-spi-flash-nor-and-ti-rtos-filesystem-support

https://forum.openwrt.org/t/aruba-ap-105-spi-flash-rpi/61215

2023年12月14日 星期四

Text-To-Speech,TTS Semantics modality modus Acoustics Harmony vocal Speech

 https://www.ptw.com/zh-cht/lab/what-is-text-to-speech
https://www.iqt.ai/tts-list
TTS語音合成-TTS 音質與試聽
雅婷文字轉語音

https://www.researchgate.net/figure/Main-causes-of-acoustic-and-linguistic-variation-in-speech_fig1_221483511
Main causes of acoustic and linguistic variation in speech. | Download Scientific Diagram(PDF) Robust methods in automatic speech recognition and understanding

https://ecampusontario.pressbooks.pub/essentialsoflinguistics2/chapter/3-1-modality/
Figure 3.1. Steps in the transmission of a linguistic signal from one person to another.Spoken and signed languagesThe modality of spoken languages, such as English and Cantonese, is vocal, because they are articulated with the vocal tract; acoustic, because they are transmitted by sound waves; and auditory, because they are received and processed by the auditory system. This modality is often shortened to vocal-auditory, leaving the acoustic nature of the signal implied, since that is the ordinary input to the auditory system.

//////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
http://mirlab.org/jang/books/audiosignalprocessing/humanVoiceProduction.asp?title=3-3%20Human%20Voice%20Production%20(%A4H%C1n%AA%BA%B2%A3%A5%CD)&language=chinese
3-3 Human Voice Production (人聲的產生)
Example Programs  如何取得程式碼 https://picture.iczhiku.com › SHKTqsDpLSPulNNbPDF3-3 Human Voice Production (人聲的產生). The procedure from human voice production to voice recognition involves the following steps: 1. Rapid open and close ...

1. Utility Toolbox 2. DCPR Toolbox 3. Audio Procesing Toolbox 4. ASR Toolbox (For speech recognition only) 5. Melody Recognition Toolbox (For melody recognition only)
//////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
https://picture.iczhiku.com/resource/eetop/SHKTqsDpLSPulNNb.pdf

picture.iczhiku.com/resource/eetop pdf

• http://www.phys.unsw.edu.au/~jw/dB.html
Introduction to the definition of Decibels for measuring energy/volume ofspeech/audio signals.
• http://www.phys.unsw.edu.au/~jw/hearing.html
Introduction (including interactive demos) to curves of equal loudness.
• http://www.phys.unsw.edu.au/music/
Homepage for "Music Acoustics".
• http://www.phys.unsw.edu.au/~jw/musFAQ.html
FAQ for "Music Acoustics".
• http://www.wotsit.org
File formats for various kinds, including audio and music.
• http://www.speech.cs.cmu.edu/comp.speech/index.html
FAQ for the newsgroup "Comp.Speech".
• http://www.bdti.com/faq/dsp_faq.htm
FAQ for the news group "Comp.DSP".
• http://www.harmony-central.com/Effects/effects-explained.html

Introduction to audio effects, including many examples.Chapter 2: MATLAB BasicsIt is very handy to use MATLAB for audio signal processing. To get started with MATLAB,please read the following tutorials on MATLAB basics directly.• MATLAB Primer by Sigmon (in English)• 02-初探MATLAB.pdf(Examples)(in Chinese)
///////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////

https://en.wikipedia.org/wiki/Speech_synthesis
https://en.wikipedia.org/wiki/FreeTTS

2023年5月12日 星期五

voice recognition speech features selected sound database feature code Phonetic Alphabet transcription Speech coding Encoder

 Speech codecs Audio codecs PCM DPCM ADPCM CVSDM ATC SBC APC Adaptive Differential Pulse Code Modulation

 https://en.wikipedia.org/wiki/Category:Speech_codecs

 https://www.cs.columbia.edu/~hgs/audio/codecs.html

 https://sip-systems.com/f/voip-audio-codecs/

 https://en.wikipedia.org/wiki/Code-excited_linear_prediction

 voice recognition speech features selected sound database feature code Phonetic Alphabet transcription Speech coding  Encoder

 https://www.researchgate.net/figure/Semantic-levels-of-a-speech-signal_fig1_307889083

A study of transformer-based end-to-end speech recognition system for Kazakh language | Scientific Reports

2022年4月26日 星期二

GPIB-488

 This is one-purpose program created for management of experiment on custom  aparatus. His goal take measurements of values for computation if Seebeck  coefficient

 https://github.com/pinkavaj/seebrez/blob/master/GPIB-488/Language%20Interfaces/Delphi/GPIB.PAS

 seebrez/GPIB-488/Language Interfaces/Delphi/

 https://www.ni.com/zh-tw/support/downloads/drivers/download.ni-488-2.html#442610

  NI-488.2 with LabVIEW

 

 

 

 

 

 

 

 

https://github.com/pinkavaj/seebrez/tree/master/TPCM.

2022年2月2日 星期三

Codec IMA ADPCM pour MSACM

 ADVAPI32.dll
GDI32.dll
KERNEL32.dll
USER32.dll
WINMM.dll

imaadp32.acm
https://docs.microsoft.com/zh-tw/windows/win32/multimedia/microsoft-corporation-product-identifiers
Codec IMA ADPCM pour MSACM
https://docs.microsoft.com/en-us/windows/win32/api/msacm/nf-msacm-acmdriverenum
https://docs.microsoft.com/en-us/windows/win32/xaudio2/adpcm-overview
https://docs.microsoft.com/en-us/windows/win32/directshow/choosing-a-compression-filter

NAudioDemo - GitHub
https://github.com/naudio/NAudio/blob/master/Docs/EnumerateAcmDrivers.md
naudio/NAudio: Audio and MIDI library for .NET - GitHub
enumerating ACM file codec windows List all installed multimedia codecs
https://social.technet.microsoft.com/Forums/Lync/en-US/584e73b8-7a4b-4e39-b2cc-51bbda1875a9/windows-media-player-will-not-play-regular-codecs-such-as-mp3-wmv-and-avi?forum=w7itpromedia

ADVAPI32.dll WINMM.dll codec Windows Media Player 12 Codec problem - TechNet Microsoft

7 Programs to Check Installed Audio and Video Codecs On Your Computer

 

 

https://social.technet.microsoft.com/Forums/windows/en-US/921d4e11-40b9-4aad-ba41-fb8f3b698310/windows-media-player-12-playing-mpeg2s-extremely-loud?forum=w7itpromedia