• Anytime
    Application Deadline
  • 28 days
    Duration
EdX is an online learning platform trusted by over 12 million users offering the Speech Recognition Systems Certificate in collaboration with MicrosoftX. Learn about the pieces of a modern automatic speech recognition (ASR) system as we cover fundamental acoustic and linguistic theory, data preparation, language modeling, acoustic modeling, and decoding. 

About

The Speech Recognition Systems Certificate from EdX is offered in partnership with MicrosoftX.

Developing and understanding Automatic Speech Recognition (ASR) systems is an inter-disciplinary activity, taking expertise in linguistics, computer science, mathematics, and electrical engineering. 

When a human speaks a word, they cause their voice to make a time-varying pattern of sounds. These sounds are waves of pressure that propagate through the air. The sounds are captured by a sensor, such as a microphone or microphone array, and turned into a sequence of numbers representing the pressure change over time. The automatic speech recognition system converts this time-pressure signal into a time-frequency-energy signal. It has been trained on a curated set of labeled speech sounds, and labels the sounds it is presented with. These acoustic labels are combined with a model of word pronunciation and a model of word sequences, to create a textual representation of what was said.

Instead of exploring one part of this process deeply, this course is designed to give an overview of the components of a modern ASR system. In each lecture, we describe a component's purpose and general structure. In each lab, the student creates a functioning block of the system. At the end of the course, we will have built a speech recognition system almost entirely out of Python code.

Detailed Programme Facts

  • Deadline and start date A student can apply at any time for this programme, there is no deadline.
  • Programme intensity Part-time
    • Average part-time duration 28 days
    • Part-time variant
      Flexible
    • Duration description

      4 weeks

      5 to 6 hours per week

  • Languages
    • English
  • Delivery mode
    Online
  • More information Go to the programme website

Programme Structure

What you'll learn:
  • Fundamentals of Speech Recognition
  • Basic Signal Processing for Speech Recogntion
  • Acoustic Modeling and Labeling
  • Common Algorithms for Language Modeling
  • Decoding Acoustic Features into Speech

English Language Requirements

This programme may require students to demonstrate proficiency in English.

Academic Requirements

Prerequisites:

  • Some python experience
  • Basic Machine Learning principles
  • Knowledge of probability and statistics

Tuition Fee

  • International

    99 USD/full
    Tuition Fee
    Based on the original amount of 99 USD for the full programme and a duration of 28 days.
  • National

    99 USD/full
    Tuition Fee
    Based on the original amount of 99 USD for the full programme and a duration of 28 days.
We've labeled the tuition fee that applies to you because we think you are from and prefer over other currencies.
5% discount coupon: Z7LZNQ4TN3B2JTWU valid for any free course + certificate upgrade

Funding

Check the programme website for information about funding options.

StudyPortals Tip: Students can search online for independent or external scholarships that can help fund their studies. Check the scholarships to see whether you are eligible to apply. Many scholarships are either merit-based or needs-based.

The Global Study Awards: get funded with up to £10,000 to study abroad

Together with the ISIC Association and British Council IELTS, Studyportals offers you the chance to receive up to £10000 to expand your horizon and study abroad. We want to ultimately encourage you to study abroad in order to experience and explore new countries, cultures and languages.