HomeExploremfcc

Mfcc Collection

Repositories tagged with "mfcc"

RARE

TCG-style cards with ATK/DEF/SPD stats

RARE

⭐16.3kHP

◆

🔮Psychic

★★★

numpy-ml

ddbourgin

Pythonattentionbayesian-inference

“Machine learning, in numpy”

★

16.3k

3.8k

16.3k

3.8k forks

ATK

DEF

SPD

GitPedia #568

3/5

View wiki →𝕏

GitPedia

Repository Card

RARE

★

16.3k

3.8k

16.3k

UNCOMMON

⭐3.7kHP

◆

⚔️Fighting

★★

aubio

Canalysisannotation

“a library for audio and music analysis”

★

3.7k

417

3.7k

417 forks

ATK

DEF

SPD

GitPedia #786

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

3.7k

417

3.7k

UNCOMMON

⭐3.3kHP

◆

⚔️Fighting

★★

audioFlux

libAudioFlux

Caudioaudio-analysis

“A library for audio and music analysis, feature extraction.”

★

3.3k

144

3.3k

144 forks

ATK

DEF

SPD

GitPedia #300

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

3.3k

144

3.3k

UNCOMMON

⭐689HP

◆

🔮Psychic

★★

emotion-recognition-using-speech

x4nth055

Pythondeep-learningemotion-detection

“Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras”

★

689

251

689

251 forks

ATK

DEF

SPD

GitPedia #526

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

689

251

689

UNCOMMON

⭐543HP

◆

🔮Psychic

★★

NWaves

ar1st0crat

C#adaptive-filteringaudio

“.NET DSP library with a lot of audio processing functions”

★

543

86 forks

ATK

DEF

SPD

GitPedia #373

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

543

COMMON

⭐485HP

◆

🔮Psychic

★

spafe

SuperKogito

Pythonaudioaudio-analysis

“:sound: spafe: Simplified Python Audio Features Extraction”

★

485

77 forks

ATK

DEF

SPD

GitPedia #585

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

485

COMMON

⭐399HP

◆

🔥Fire

★

Gist

adamstark

C++audioaudio-analysis

“A C++ Library for Audio Analysis”

★

399

77 forks

ATK

DEF

SPD

GitPedia #179

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

399

COMMON

⭐257HP

◆

🔮Psychic

★

Speech_Signal_Processing_and_Classification

gionanide

Pythonclassifierfeature-extraction

“Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].”

★

257

63 forks

ATK

DEF

SPD

GitPedia #234

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

257

COMMON

⭐246HP

◆

🔥Fire

★

SPTK

sp-nitech

C++audio-processingcepstrum

“A suite of speech signal processing tools”

★

246

28 forks

ATK

DEF

SPD

GitPedia #643

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

246

COMMON

⭐239HP

◆

🔥Fire

★

LibrosaCpp

ewan-xu

C++eigenlibrosa

“LibrosaCpp is a c++ implemention of librosa to compute short-time fourier transform coefficients,mel spectrogram or mfcc”

★

239

54 forks

ATK

DEF

SPD

GitPedia #206

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

239

COMMON

⭐226HP

◆

🔮Psychic

★

pyAudioProcessing

jsingh811

Pythonaudio-dataaudio-files

“Audio feature extraction and classification”

★

226

41 forks

ATK

DEF

SPD

GitPedia #642

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

226

COMMON

⭐221HP

◆

🔮Psychic

★

Voice-based-gender-recognition

SuperKogito

Pythondata-sciencegaussian-mixture-models

“:sound: :boy: :girl:Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)”

★

221

65 forks

ATK

DEF

SPD

GitPedia #014

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

221

COMMON

⭐213HP

◆

🔥Fire

★

kaldifeat

csukuangfj

C++cppfbank

“Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API”

★

213

39 forks

ATK

DEF

SPD

GitPedia #674

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

213

COMMON

⭐201HP

◆

🔮Psychic

★

diffsptk

sp-nitech

Pythoncepstrumcqt

“A differentiable version of SPTK”

★

201

20 forks

ATK

DEF

SPD

GitPedia #973

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

201

COMMON

⭐179HP

◆

⚔️Fighting

★

MevonAI-Speech-Emotion-Recognition

SuyashMore

Cartificial-intelligencecolab-notebook

“Identify the emotion of multiple speakers in an Audio Segment”

★

179

46 forks

ATK

DEF

SPD

GitPedia #433

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

179

COMMON

⭐158HP

◆

🔮Psychic

★

subsync

tympanix

Pythondelayfix

“Synchronize your subtitles using machine learning”

★

158

15 forks

ATK

DEF

SPD

GitPedia #337

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

158

COMMON

⭐132HP

◆

📦Normal

★

speech-emotion-recognition

amanbasu

Jupyter Notebookdeep-learningemotion

“Detecting emotions using MFCC features of human speech using Deep Learning”

★

132

38 forks

ATK

DEF

SPD

GitPedia #213

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

132

COMMON

⭐88HP

◆

🔮Psychic

★

AcousticKeyBoard-Web

ZhuoZhuoCrayon

Pythondeep-learningdjango

“❓声学键盘｜脑洞大开：做一个能听懂键盘敲击键位的「玩具」，学习信号处理 / 深度学习 / 安卓 / Django。”

★

6 forks

ATK

DEF

SPD

GitPedia #658

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★