Speech Recognition in Pure Language Processing

Machine Learning

Speech Recognition in Pure Language Processing

hhhhm

2023年12月7日

Speech Recognition in Pure Language Processing

[ad_1]

Speech recognition is an interdisciplinary sub-field in pure language processing. It makes use of a sub-field of laptop science and computational linguistics. We additionally know speech recognition’s with numerous names like speech to textual content, laptop speech recognition, and automated speech recognition.

What are totally different Fashions, strategies, and algorithms utilized in Speech Recognition?

There are numerous modelling methods utilized in speech recognition like acoustic modelling, doc classification, statistical machine translation and language modelling. Most of contemporary common objective speech recognition’s system are primarily based on Hidden Markov Fashions.

Additionally See: What are main elements of machine studying?

Hidden Markov Fashions (HMMs)

Most of contemporary general-purpose speech to textual content are primarily based on Hidden Markov Fashions. The rationale why Hidden Markov Fashions is fashionable as they are often mechanically educated, easy to make use of and computationally possible to make use of.

The time period hidden is for first order Markov course of behind the commentary. On this case the commentary refers to information which we all know that’s “stroll”, “store” and “clear”. Hidden States are “wet” and “sunny”. We may have an in depth publish on this mannequin.

Hidden Markov Models — Hidden Markov Fashions

Dynamic time warping (DTW) primarily based speech recognition

DTW is an strategy which has been used traditionally used for speech recognition. It’s an algorithm for measuring similarity between two sequences that modify in time or pace.

Neural Networks

Neural networks primarily work as mind through which speech to textual content is detected, the great thing about neural networks is map non-linear relationships. Nonetheless, one drawback for neural networks is information preprocessing, transformation, and dimensionality discount. We’ll talk about neural networks later in publish.

Deep feedforward and recurrent neural networks

Like above however few modifications, we’ll talk about this in one other publish.

Finish to Finish automated speech recognition

Finish to finish fashions collectively study all elements of speech to textual content. An instance is Hey Google and Apple’s siri. Most on this in later publish.

Software of Speech to textual content

Video Video games
Digital assistant
Transcription
Car navigation methods
Actual time video captioning
Multi issue authentication
Robotics
Cell phone
IVR (Interactive voice response)
Palms free computing
Emotion recognition

Efficiency metrics for speech to textual content

WER = (no. of substitutions + no. of deletions + no. of insertions)/ (no. of substitutions + no. of deletions + no. of insertions + no. of right phrases)

Actual time issue
Single phrase error price (SWER)
Command success price (CSR)

On this publish, you might have seen the introduction of speech to textual content, which has main purposes throughout industries.

[ad_2]