https://www.researchgate.net/publication/221258872_a_simple_but_effective_approach_to_speaker_tracking_in_broadcast_news
https://www.researchgate.net/figure/automatic-segmentation-of-the-audio-recorded-in-the-cafeteria-noisy-environment-by_fig2_323155847
https://genekogan.com/works/field-rec-navigator/
Visualizing my field recordings
https://www.jyu.fi/hytk/fi/laitokset/mutku/en/research/materials/mirtoolbox
automatically segmented the raw recordings segmentation algorithm
https://sourceforge.net/projects/supercollider/
https://en.wikipedia.org/wiki/Principal_component_analysis
https://en.wikipedia.org/wiki/Music_information_retrieval
https://lgm.fri.uni-lj.si/research/segmentation-of-field-recordings/
Segmentation of field recordings — LGM
https://yaiglobal.com/index.php/component/k2/item/5-audio-segmentation
https://www.researchgate.net/figure/audio-onset-segmentation-dashed-lines-variable-window-length-segmentation-empty_fig1_252187078
http://recherche.ircam.fr/equipes/temps-reel/audio-mosaicking/
Real-Time Audio Mosaicking
MFCC based Frame by Frame Audio Mosacing MEL-frequency cepstrum coefficients (MFCC).
Semi-supervised feature selection for audio classification based on constraint compensated Laplacian score | EURASIP Journal on Audio, Speech, and Music Processing | Full Text
figure 5 Accuracy as a function of the number of selected features
https://asmp-eurasipjournals.springeropen.com/articles/10.1186/s13636-016-0086-9
2024年1月15日 星期一
speech segmentation speech segmenting algorithm
audio Packages DLL music Audio libraries library
SEARCH Packages Linux Unix
https://slackbuilds.org/result/?search=audio&sv=15.0
https://pkgs.org/download/libopenshot-audio
https://en.wikipedia.org/wiki/Category:Audio_libraries
https://en.wikipedia.org/wiki/Category:Video_game_music_technology
category is Audio library.Pages in category "Audio libraries"
BASS
ClanLib
DirectSound
Enlightened Sound Daemon
FMOD
JACK Audio Connection Kit
Libavcodec
Miles Sound System
Open Sound System
OpenAL
OpenSL ES
PulseAudio
Raylib
Simple DirectMedia Layer
UFMOD
Audio libraries Raylib ClanLib Libavcodec PulseAudio
https://packages.altlinux.org/en/p10/srpms/libopenshot-audio/
https://ru.wikipedia.org/wiki/%D0%9A%D0%B0%D1%82%D0%B5%D0%B3%D0%BE%D1%80%D0%B8%D1%8F:%D0%90%D1%83%D0%B4%D0%B8%D0%BE%D0%B1%D0%B8%D0%B1%D0%BB%D0%B8%D0%BE%D1%82%D0%B5%D0%BA%D0%B8
libopenshot-audio JUCE audio library
audio/opus-tools
https://en.wikipedia.org/wiki/Opus_(audio_format)
https://wiki.xiph.org/Opus-tools
libs For Qt5. pulseaudio qt
https://archlinux.org/packages/extra/x86_64/pulseaudio-qt/
https://packages.fedoraproject.org/pkgs/pulseaudio-qt/pulseaudio-qt-qt5/
https://github.com/OpenShot/libopenshot-audio
OpenShot Audio Library libopenshot-audio OpenShot Audio Library (libopenshot-audio) is a free, open ...
GitHub https://github.com › OpenShot › libope...
OpenShot Audio Library (libopenshot-audio)
https://www.haskell.org/
haskell Packages bindings audio haskell bindings audio
https://hackage.haskell.org/package/htaglib
https://hackage.haskell.org/package/jack
Octave packages audio Audio and MIDI Toolbox for GNU Octave. SHA256
https://en.wikipedia.org/wiki/UFMOD
pectrum-devel pkgs top-level/all-packages
https://spectrum-os.org/lists/archives/spectrum-devel/c22a62456db/s/?b=pkgs/top-level/all-packages.nix
https://discourse.julialang.org/t/package-trouble-again/104977
https://ctepp.calstate.edu/tlab-change-packages
Steam Audio supports the following platforms:
https://valvesoftware.github.io/steam-audio/doc/capi/getting-started.html
Steam Audio supports the following platforms
Steam Broadcasting
Steam Support
https://help.steampowered.com › view
Steam Broadcasting is currently supported by the following browsers: Steam Client; Google Chrome (version 39+); Apple Safari (version 8+ on macOS); Internet ...
Modern audio compressioninternet. Opus audio format Opus lossy audio coding format Xiph.Org Foundation standardized code speech
https://en.wikipedia.org/wiki/Opus_(audio_format)
Opus (audio format) - Wikipedia
Opus is a lossy audio coding format developed by the Xiph.Org Foundation and standardized by the Internet Engineering Task Force, designed to efficiently code speech and general audio in a singl
xiph/opus: Modern audio compression for the internet.
Opus Codec
https://opus-codec.org/
Opus Interactive Audio Codec
Overview
Opus is a totally open, royalty-free, highly versatile audio codec. Opus is unmatched for interactive speech and music transmission over the Internet, but is also intended for storage and streaming applications. It is standardized by the Internet Engineering Task Force (IETF) as RFC 6716 which incorporated technology from Skype’s SILK codec and Xiph.Org’s CELT codec.
Technology
Opus can handle a wide range of audio applications, including Voice over IP, videoconferencing, in-game chat, and even remote live music performances. It can scale from low bitrate narrowband speech to very high quality stereo music. Supported features are:
Bitrates from 6 kb/s to 510 kb/s
Sampling rates from 8 kHz (narrowband) to 48 kHz (fullband)
Frame sizes from 2.5 ms to 60 ms
Support for both constant bitrate (CBR) and variable bitrate (VBR)
Audio bandwidth from narrowband to fullband
Support for speech and music
Support for mono and stereo
Support for up to 255 channels (multistream frames)
Dynamically adjustable bitrate, audio bandwidth, and frame size
Good loss robustness and packet loss concealment (PLC)
Floating point and fixed-point implementation
2024年1月13日 星期六
rtsp Streaming Media streaming library LIVE555 Media Server Proxy Server HLS Proxy vobStreamer streaming DVD RTP/RTCP/RTSP
https://en.wikipedia.org/wiki/Real-Time_Streaming_Protocol
LIVE555 Streaming Media
This code forms a set of C++ libraries for multimedia streaming, using open standard protocols (RTP/RTCP, RTSP, SIP). These libraries - which can be compiled for Unix (including Linux and Mac OS X), QNX (and other POSIX-compliant systems) - can be used to build streaming applications. The libraries are already being used to implement applications such as the "LIVE555 Media Server", "LIVE555 Proxy Server", and "LIVE555 HLS Proxy" and "vobStreamer" (for streaming DVD content using RTP/RTCP/RTSP). The libraries can also be used to stream, receive, and process MPEG, H.265, H.264, H.263+, DV or JPEG video, and several audio codecs. They can easily be extended to support additional (audio and/or video) codecs, and can also be used to build basic RTSP or SIP clients and servers, and have been used to add streaming support to existing media player applications, such as "VLC" and "MPlayer". (For some specific examples of how these libraries can be used, see the test programs below.)
https://girishjoshi.io/post/stream-a-video-over-rtsp-using-live555mediaserver/
串流伺服器特性剖析 - 國立交通大學國立陽明交通大學機構典藏 https://ir.nctu.edu.tw › bitstreamPDF由 吳宗修 著作 · 2006 — A streaming server will consists of three important modules, they are " Set up a standard connection procedure "," RTSP Signaling Negotiation " and " packet ...
https://github.com/bluenviron/mediamtx
RTSP Pull - OvenMediaEngine
https://airensoft.gitbook.io/ovenmediaengine/live-source/rtsp-pull-beta
https://gstreamer.freedesktop.org/documentation/gst-rtsp-server/rtsp-server.html?gi-language=c
https://github.com/aler9/rtsp-simple-proxy
RTSP/RTP streaming support for MPlayer
http://www.live555.com/mplayer/
RTSP SDK Libraries for Windows, .NET 6+, .NET Framework, C#, VB, C/C++, and Python | LEADTOOLS
https://www.leadtools.com/sdk/multimedia/streaming/rtsp
Managed Media Aggregation using Rtsp and Rtp - CodeProject
https://www.codeproject.com/Articles/507218/Managed-Media-Aggregation-using-Rtsp-and-Rtp
RTSP & RTP Client, Broadcaster & Server Library | VASTreaming
https://www.vastreaming.net/rtsp-library.html
rtsp RTP Streaming Media streaming library
https://github.com/ekumenlabs/AndroidStreamingClient/blob/master/android_streaming_client/src/main/java/com/c77/androidstreamingclient/lib/rtp/RtpMediaDecoder.java
android streaming client lib
DataPacketTracer
MediaExtractor
RtpMediaDecoder
RtpMediaExtractor
streaming data packet tracer decoder extractor Naishy/rtpsplit: RTP stream extractor GitHub - Linaro/OpenCSD: CoreSight trace stream decoder
rtp streaming data extractor
https://wiki.wireshark.org/rtp_statistics
speech audio processing coding enhancement audio library adpcm acelp pulse density
speech Audio codecs adpcm acelp Audio codecs
Vocoders, Audio Codecs and Speech Compression Software
GAO Research
http://www.gaoresearch.com › products
ITU-T Vocoder Standards for Speech Processing Software and Audio Processing Codecs ; ITU-T G.723.1, 6.3 and 5.3 kbit/s, MP-MLQ, and ACELP based codec ; ITU-T G.
Comparison of audio coding formats - Wikipedia
https://en.wikipedia.org/wiki/Comparison_of_audio_coding_formats
TwoCC - MultimediaWiki
https://wiki.multimedia.cx/index.php/TwoCC
The TwoCC is the audio counterpart to the video FourCC. It is the audio format identifier used in the RIFF based multimedia formats by Microsoft (WAV and AVI). The TwoCC is 2 bytes long and stored in little endian format on disk. You can register your TwoCC with Microsoft but it seems that only some companies perform this process.
https://wiki.multimedia.cx/index.php/Category:Audio_Codecs
G.7xx: Audio (Voice) Compression Protocols (CODEC) (PDF)
Transcoding of Voice Codecs G.711 to G.729 and ...
PDF iTu T G.7xx Standards for Speech Codec
https://academic-accelerator.com/encyclopedia/zh/g-722
g.722 G 722: 最新的百科全書、新聞、評論和研究
https://www.wikiwand.com/en/List_of_video_compression_formats
List of codecs - Wikiwand
Non-compression Linear pulse-code modulation (LPCM, generally only described as PCM) is the format for uncompressed audio in media files and it is also the standard for CD-DA; note that in computers, LPCM is usually stored in container formats such as WAV, AIFF, or AU, or as raw audio format, although not technically necessary. FFmpeg Pulse-density modulation (PDM) Direct Stream Digital (DSD) is standard for Super Audio CD foobar2000 Super Audio CD Decoder (based on MPEG-4 DST reference decoder) FFmpeg (based on dsd2pcm) Pulse-amplitude modulation (PAM)
2024年1月12日 星期五
gsm audio speech telecommunications technology Audio Compression , communications system voice codec VoIP speech pcm amr-wb opus SPEEX
//////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
bass library g729 g719 g722 G.726 Code-excited linear prediction
speech telecommunications technology Audio Compression
https://github.com/sippy/libg722
AES E-Library » Real-Time CELP Speech Coding in a Voice Response Environment
https://www.aes.org/e-lib/online/browse.cfm?elib=5530
CELP speech Code-excited linear prediction
https://en.wikipedia.org/wiki/RTP_payload_formats
g729a acelp internet audio stream message rtp payload format for the g.729.1 audio codec rfc 4749
https://en.wikipedia.org/wiki/Category:Speech_codecs
https://en.wikipedia.org/wiki/CELT
https://en.wikipedia.org/wiki/G.729.1
https://en.wikipedia.org/wiki/Code-excited_linear_prediction
https://en.wikipedia.org/wiki/Speech_coding
https://github.com/sippy/libg722
https://github.com/wisekrakr/CommUniWise
https://github.com/wisekrakr/SIP_dev_pushToTalk
ITU G.722 Voice message SIP RFC 3261 github
https://github.com/ttsou/openbts-p2.8/tree/master
https://datatracker.ietf.org/doc/rfc6366/
github SIP AudioFrame
https://github.com/onmyway133/awesome-voip
https://github.com/sipsorcery-org/sipsorcery/issues/914
https://github.com/AGProjects/sipclients/blob/master/sip-audio-session
https://github.com/baresip/baresip/blob/main/test/call.c
https://github.com/pjsip/pjproject/blob/master/pjsip-apps/src/samples/siprtp.c
g729a acelp internet audio stream message
rtp payload format for the g.729.1 audio codec rfc 4749
https://en.wikipedia.org/wiki/RTP_payload_formats
g729a acelp rtp g.729.1 rfc4749 pdf
JT-G729 とビット列互換な 8-32kbit 一般社団法人情報通信技術委員会 https://www.ttc.or.jp › files › JT-G729.1v5.pdf PDF 2013年11月14日 — (2) TTC標準JT-G729.1は、ITU-T勧告G.729.1に ... ACELP を用いた音声符号化方式. (2) TTC標準JT-G729付属資料A. 低 ...
RTP Payload for DTMF Digits, Telephony Tones, and. Telephony Signals. [RFC 4749] IETF RFC 4749 (2006), RTP Payload Format for the G.729.1 Audio
g729a acelp rtp g.729.1 rfc4749 pdf g.729.1 audio acelp android usacdec_acelp acelp audio stream frame
aaa android rtp audio stream frame mediacodec messenger stackoverflow mediacodec decode aac audio chunks from rtsp and play
https://stackoverflow.com/questions/48602108/mediacodec-decode-aac-audio-chunks-from-rtsp-and-play
https://developer.android.com/reference/android/media/MediaCodec.html
https://github.com/imansaleh16/Stack-Overflow-Tags-Communities/blob/master/dataset/E_llda
https://github.com/pedroSG94/RootEncoder/wiki
////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
Network ProtocolsThe following network protocols are supported for audio and video playback: RTSP (RTP, SDP) HTTP/HTTPS progressive streaming HTTP/HTTPS live streaming draft protocol : MPEG-2 TS media files only Protocol version 3 (Android 4.0 and above) Protocol version 2 (Android 3.x) Not supported before Android 3.0
https://github.com/fyhertz/libstreaming/blob/master/src/net/majorkernelpanic/streaming/audio/AACStream.java
RTP aac android.media.MediaCodec RTP STREAM github wire
https://github.com/ekumenlabs/AndroidStreamingClient/blob/master/android_streaming_client/src/main/java/com/c77/androidstreamingclient/lib/rtp/RtpMediaDecoder.java
https://github.com/ekumenlabs/AndroidStreamingClient/tree/master
RTP aac android.media.MediaCodec RTP STREAM github wire
https://www.codeproject.com/Articles/797537/Making-an-Audio-Spectrum-analyzer-with-Bass-dll-Cs
https://delphi-lab.ucoz.ru/load/17-1-0-32
Главная » Файлы » VCL » Sound and Multimedia
Audio Tools Library v.1.4
[ Скачать с сервера (95.6 Kb) ] 10.07.2008, 02:19
By J. Faul. ATL - programming tools for manipulating with some audio file formats. The pack uncludes several components described below:
* MPEGaudio - for manipulating with MPEG audio file information,
* ID3v1 - for manipulating with ID3v1 tags,
* ID3v2 - for manipulating with ID3v2 tags,
* WAVfile - for extracting information from WAV file header,
* OggVorbis - for extracting information from Ogg Vorbis file header,
* MPEGplus - for manipulating with MPEGplus file information,
* TwinVQ - for extracting information from TwinVQ file header,
* Monkey - for manipulating with Monkey's Audio file information.
Live555 RTSP Server on Android rtsp server on android
https://github.com/papan01/Live555-server-android
http://hank5000.github.io/blog/2015/06/24/live555-rtsp-server-on-android/
developer.android.com/reference/androidx/media3/exoplayer/rtsp/reader/rtpac3reader
https://developer.android.com/reference/kotlin/androidx/media3/exoplayer/rtsp/reader/RtpPcmReader
https://developer.android.com/reference/kotlin/androidx/media3/exoplayer/rtsp/reader/RtpAc3Reader
https://developer.android.com/media/media3/exoplayer/rtsp?hl=zh-tw
https://developer.android.com/reference/kotlin/androidx/media3/exoplayer/rtsp/reader/RtpAc3Reader
https://developer.android.com/reference/androidx/media3/exoplayer/rtsp/package-summary
Bass Audio Library https://github.com/topics/bass-dll base.dll
https://www.codeproject.com/Articles/2848/nBASS-A-sound-libary-for-NET
Un4seen Developments
https://www.un4seen.com/
BASS is an audio library for use in Win32, MacOS, Linux and PocketPC software. It's purpose is to provide the most powerful and efficient (yet easy to use), sample, stream, MOD music, and recording functions. This library was written by Ian Luck, over at Un4seen Developments. New features include Add-on plugin system, MOD position & syncing in bytes, Support for AIFF files, Floating-point sampling, More options, and More.The BASS audio library is used in MediaPortal for the default BASS audio player.
https://github.com/topics/bass-library
https://en.wikipedia.org/wiki/Bass
https://en.wikipedia.org/wiki/AIMP
BASS audio library v2.4 PureBasic 4.20 includes. - PureBasic Forums - English
https://github.com/ans-hub/audio_out
https://www.team-mediaportal.com/wiki/display/glossary/BASS+Audio+Library
http://bass.radio42.com/
bass.dll play delphi
https://itecnote.com/tecnote/delphi-load-bass-dll-and-play-mp3/
https://github.com/Zaflis/nxpascal/tree/master
https://github.com/Zaflis/nxpascal/blob/master/src/Bass.pas
Delphi, C++, VB - BASS Audio Recognition Library
https://www.3delite.hu/Object%20Pascal%20Developer%20Resources/bassaudiorecognitionlibrary.html
Bass.BASS_ChannelPlay Method
https://github.com/ManagedBass/ManagedBass/blob/master/src/AddOns/BassDShow/BassDShow.cs
Un4seen Developments
https://github.com/DragonMinded/xmplay
https://github.com/DragonMinded/libnaomi
http://support.xmplay.com/
http://bass.radio42.com/help/html/743b046b-0c42-71a0-b613-799f5f0450b9.htm
http://docwiki.embarcadero.com/RADStudio/Athens/en/Libraries_and_Packages_(Delphi)
https://delphimagic.blogspot.com/2013/05/escuchar-la-radio-por-streaming.html
https://stackoverflow.com/questions/8964488/delphi-load-bass-dll-and-play-mp3
BASS.DLL GITHUB
https://github.com/ManagedBass/ManagedBass/tree/master/src/Bass/Shared/Bass
Radio streaming with bass.dll
https://autoit.de/thread/25624-radio-streaming-with-bass-dll/
IPHLPAPI.dll IPHLPAPI ip helper api Iphlpapi.h header
WSOCK32.dll
wasapi Windows Audio Session API
base-dll basswasapi.dll
https://superblt.znix.xyz/
https://superblt.znix.xyz/doc/xaudio/
XML Tweaker - SuperBLT
SuperBLT cross-platform audio API
base.dll script plugin header
https://wiki.mairlist.com/faq:bass-plugins
autohotkey
powerbasic base.dll script plugin header powerbasic Sound BASS.DLL API encapsulation
foobar components
powerbasic third-party-addons
sound-bass-dll-api-header-file
cwmp-data-models客戶端設備廣域網路管理傳輸協定與裝置數據模組 网管协议数据模型定义,cwmp-data-models
2024年1月6日 星期六
DY-SV17F W0974 dy1703A flash 32Mbit mp3 music plaer memory Winbond's W25X and W25Q SpiFlash® Multi-I/O Memories feature the popular Serial Peripheral Interface (SPI), densities
DY-SV17F Audio Module Mini MP3 Player IO Trigger USB ...
DY-SV17F
DY-SV17F Audio Module Mini MP3 Player IO Trigger USB Download Flash Voice Module ; Supports Recording FunctionYes ; Display SizeNone ; PackageYes ; Mode
DY-SV17F voice module integrates IO segment trigger, UART serial port control, ONE_line single bus serial port control, standard MP3 and other 7 working modes; onboard 5W Class D power amplifier can directly drive 4Ω 3~5W speaker; support MP3, WAV decoding format, onboard 32Mbit (4MByte) flash storage audio file, can connect to the computer to update audio files through USB data cable.
Support MP3 and WAV decoding formats.
Support sampling rate (KHz): 8/11.025/12/16/22.05/24/32/44.1/48.
24-bit DAC output, dynamic range support 90dB, signal-to-noise ratio support 85dB.
Onboard 32Mbit (4MByte) flash storage, you can connect the computer to update the audio file through the USB data cable.
Comes with 5W class D power amplifier, can directly drive 4Omega, 3~5W speakers.
UART serial port control voice broadcast function, can control playback, pause, song selection, volume addition and subtraction, etc., the largest selection of 65,535 songs, baud rate of 9600 bps.
Support IO trigger playback function, 8 IO ports trigger 8 tracks or 8 IO ports to trigger 255 tracks.
Support One_line single bus serial port control, can control playback, pause, song selection, volume addition and subtraction and other functions.
Support 3 configuration IO for up to 7 working mode selection.
Notice:
1."Key combination playback" refers to the restoration of the original high level after io0-io7 output the corresponding level, similar to the key trigger once; "Level combination playback" refers to the io0-io7 output of the corresponding level to maintain the same level.
3.The difference between "I/O combination (independent) mode 0" and "I/O combination (independent) mode 1" is that the former mode continues to play the current track until the end after releasing the level, while the latter mode immediately stops playing the track after releasing the level.
Package Includes: DY-SV17F MP3 Voice Module
25Q32 SPI NOR Flash
BY25Q32BS 32M-bit Serial Peripheral Interface(SPI) Flash memory
https://www.winbond.com/hq/product/code-storage-flash-memory/serial-nor-flash/?__locale=en
Winbond's W25X and W25Q SpiFlash® Multi-I/O Memories feature the popular Serial Peripheral Interface (SPI), densities from 512K-bit to 512M-bit, small erasable ...
25Q16 Serial SPI Serial NOR flash Memory FLASH
MACRONIX MX25V1635FZNI
Flash Memory, Serial NOR, 16 Mbit, 2M x 8bit, SPI, WSON, 8 Pins
https://forum.openwrt.org/t/two-spi-nor-flash-chips-in-parallel-possible/30529/4
Two SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendations - OpenWrt Forum
Two SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendations - OpenWrt ForumTwo SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendations - OpenWrt Forum Two SPI NOR Flash chips in parallel, possible? - Hardware Questions and Recommendation (But use NOR SPI Fash instead SD-Card)
https://forum.openwrt.org/t/two-spi-nor-flash-chips-in-parallel-possible/30529
https://www.digikey.tw/zh/schemeit/project/detail/huzzah-cc3000-wifi-breakout-1469-FASS1RO100S0
https://e2e.ti.com/support/microcontrollers/msp-low-power-microcontrollers-group/msp430/f/msp-low-power-microcontroller-forum/708763/msp432e401y-spi-flash-nor-and-ti-rtos-filesystem-support
https://forum.openwrt.org/t/aruba-ap-105-spi-flash-rpi/61215
2023年12月14日 星期四
Text-To-Speech,TTS Semantics modality modus Acoustics Harmony vocal Speech
https://www.ptw.com/zh-cht/lab/what-is-text-to-speech
https://www.iqt.ai/tts-list
TTS語音合成-TTS 音質與試聽
雅婷文字轉語音
https://www.researchgate.net/figure/Main-causes-of-acoustic-and-linguistic-variation-in-speech_fig1_221483511
Main causes of acoustic and linguistic variation in speech. | Download Scientific Diagram(PDF) Robust methods in automatic speech recognition and understanding
https://ecampusontario.pressbooks.pub/essentialsoflinguistics2/chapter/3-1-modality/
Figure 3.1. Steps in the transmission of a linguistic signal from one person to another.Spoken and signed languagesThe modality of spoken languages, such as English and Cantonese, is vocal, because they are articulated with the vocal tract; acoustic, because they are transmitted by sound waves; and auditory, because they are received and processed by the auditory system. This modality is often shortened to vocal-auditory, leaving the acoustic nature of the signal implied, since that is the ordinary input to the auditory system.
//////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
http://mirlab.org/jang/books/audiosignalprocessing/humanVoiceProduction.asp?title=3-3%20Human%20Voice%20Production%20(%A4H%C1n%AA%BA%B2%A3%A5%CD)&language=chinese
3-3 Human Voice Production (人聲的產生)
Example Programs 如何取得程式碼 https://picture.iczhiku.com › SHKTqsDpLSPulNNbPDF3-3 Human Voice Production (人聲的產生). The procedure from human voice production to voice recognition involves the following steps: 1. Rapid open and close ...
1. Utility Toolbox 2. DCPR Toolbox 3. Audio Procesing Toolbox 4. ASR Toolbox (For speech recognition only) 5. Melody Recognition Toolbox (For melody recognition only)
//////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
https://picture.iczhiku.com/resource/eetop/SHKTqsDpLSPulNNb.pdf
picture.iczhiku.com/resource/eetop pdf
• http://www.phys.unsw.edu.au/~jw/dB.html
Introduction to the definition of Decibels for measuring energy/volume ofspeech/audio signals.
• http://www.phys.unsw.edu.au/~jw/hearing.html
Introduction (including interactive demos) to curves of equal loudness.
• http://www.phys.unsw.edu.au/music/
Homepage for "Music Acoustics".
• http://www.phys.unsw.edu.au/~jw/musFAQ.html
FAQ for "Music Acoustics".
• http://www.wotsit.org
File formats for various kinds, including audio and music.
• http://www.speech.cs.cmu.edu/comp.speech/index.html
FAQ for the newsgroup "Comp.Speech".
• http://www.bdti.com/faq/dsp_faq.htm
FAQ for the news group "Comp.DSP".
• http://www.harmony-central.com/Effects/effects-explained.html
Introduction to audio effects, including many examples.Chapter 2: MATLAB BasicsIt is very handy to use MATLAB for audio signal processing. To get started with MATLAB,please read the following tutorials on MATLAB basics directly.• MATLAB Primer by Sigmon (in English)• 02-初探MATLAB.pdf(Examples)(in Chinese)
///////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////////
https://en.wikipedia.org/wiki/Speech_synthesis
https://en.wikipedia.org/wiki/FreeTTS
2023年8月22日 星期二
SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish and proxy video and audio streams.
https://github.com/bluenviron/mediamtx
2023年5月12日 星期五
voice recognition speech features selected sound database feature code Phonetic Alphabet transcription Speech coding Encoder
Speech codecs Audio codecs PCM DPCM ADPCM CVSDM ATC SBC APC Adaptive Differential Pulse Code Modulation
https://en.wikipedia.org/wiki/Category:Speech_codecs
https://www.cs.columbia.edu/~hgs/audio/codecs.html
https://sip-systems.com/f/voip-audio-codecs/
https://en.wikipedia.org/wiki/Code-excited_linear_prediction
voice recognition speech features selected sound database feature code Phonetic Alphabet transcription Speech coding Encoder
https://www.researchgate.net/figure/Semantic-levels-of-a-speech-signal_fig1_307889083
A study of transformer-based end-to-end speech recognition system for Kazakh language | Scientific Reports
2022年4月26日 星期二
GPIB-488
This is one-purpose program created for management of experiment on custom aparatus. His goal take measurements of values for computation if Seebeck coefficient
https://github.com/pinkavaj/seebrez/blob/master/GPIB-488/Language%20Interfaces/Delphi/GPIB.PAS
seebrez/GPIB-488/Language Interfaces/Delphi/
https://www.ni.com/zh-tw/support/downloads/drivers/download.ni-488-2.html#442610
NI-488.2 with LabVIEW
https://github.com/pinkavaj/seebrez/tree/master/TPCM.
2022年2月2日 星期三
Codec IMA ADPCM pour MSACM
ADVAPI32.dll
GDI32.dll
KERNEL32.dll
USER32.dll
WINMM.dll
imaadp32.acm
https://docs.microsoft.com/zh-tw/windows/win32/multimedia/microsoft-corporation-product-identifiers
Codec IMA ADPCM pour MSACM
https://docs.microsoft.com/en-us/windows/win32/api/msacm/nf-msacm-acmdriverenum
https://docs.microsoft.com/en-us/windows/win32/xaudio2/adpcm-overview
https://docs.microsoft.com/en-us/windows/win32/directshow/choosing-a-compression-filter
NAudioDemo - GitHub
https://github.com/naudio/NAudio/blob/master/Docs/EnumerateAcmDrivers.md
naudio/NAudio: Audio and MIDI library for .NET - GitHub
enumerating ACM file codec windows List all installed multimedia codecs
https://social.technet.microsoft.com/Forums/Lync/en-US/584e73b8-7a4b-4e39-b2cc-51bbda1875a9/windows-media-player-will-not-play-regular-codecs-such-as-mp3-wmv-and-avi?forum=w7itpromedia
ADVAPI32.dll WINMM.dll codec Windows Media Player 12 Codec problem - TechNet Microsoft
7 Programs to Check Installed Audio and Video Codecs On Your Computer
https://social.technet.microsoft.com/Forums/windows/en-US/921d4e11-40b9-4aad-ba41-fb8f3b698310/windows-media-player-12-playing-mpeg2s-extremely-loud?forum=w7itpromedia