Skip to main content
BirdBrain Technologies
  • Products
    • Want to learn more about our products?
      • Compare Products
      • Product Catalog
      • Try one for 30 days!
      • Request a Quote
      • Help Desk
      • Privacy Policies
    • Hummingbird Bit Robotics Kit
      Grades 4-12
      • Features
      • Buy Now
    • Finch Robot 2.0
      Grades K-College
      • Features
      • Buy Now
    • Owlet Math Tools
      Grades K-5
      • Features
      • Buy Now
    • Electric Motors Catalyst Kit
      Grades K-5
      • Features
      • Buy Now
  • Teach
    • We support teachers so they can inspire students.
      • Classroom Adaptations
      • Educator Community
      • Professional Development
      • Standards
      • Try one for 30 days!
    • Hummingbird Bit Robotics Kit
      • Start Teaching
      • Programming
      • Building
      • Projects
      • Printables
      • Have an older model?
    • Finch Robot 2.0
      • Start Teaching
      • Programming
      • Finch at Home
      • Activities
      • Printables
      • Loan Program
      • Have an older model?
    • Owlet Math Tools
      • Start Teaching
      • Teacher Guides
      • Printables
      • Owlet Apps
  • About Us
    • Mission & History
    • Our Team
    • Research
    • Media Kit
    • Contact Us
  • Resources
    • Blog
    • Teacher Stories
    • Webinars
    • Conference Presentation Support
    • Grant Assistance
    • Reading List
  • Shop
  • Support
BirdBrain Technologies
  • Products
    • Want to learn more about our products?
      • Compare Products
      • Product Catalog
      • Try one for 30 days!
      • Request a Quote
      • Help Desk
      • Privacy Policies
    • Hummingbird Bit Robotics Kit
      Grades 4-12
      • Features
      • Buy Now
    • Finch Robot 2.0
      Grades K-College
      • Features
      • Buy Now
    • Owlet Math Tools
      Grades K-5
      • Features
      • Buy Now
    • Electric Motors Catalyst Kit
      Grades K-5
      • Features
      • Buy Now
  • Teach
    • We support teachers so they can inspire students.
      • Classroom Adaptations
      • Educator Community
      • Professional Development
      • Standards
      • Try one for 30 days!
    • Hummingbird Bit Robotics Kit
      • Start Teaching
      • Programming
      • Building
      • Projects
      • Printables
      • Have an older model?
    • Finch Robot 2.0
      • Start Teaching
      • Programming
      • Finch at Home
      • Activities
      • Printables
      • Loan Program
      • Have an older model?
    • Owlet Math Tools
      • Start Teaching
      • Teacher Guides
      • Printables
      • Owlet Apps
  • About Us
    • Mission & History
    • Our Team
    • Research
    • Media Kit
    • Contact Us
  • Resources
    • Blog
    • Teacher Stories
    • Webinars
    • Conference Presentation Support
    • Grant Assistance
    • Reading List
  • Shop
  • Support
Finch Robot 2.0

l AI

Activity 2: Audio Recognition

Intro to AI and Machine Learning

This is the second activity in the AI with Finch series and assumes you have completed Activity 1: Image Recognition prior to beginning this lesson. An audio recognition model works similarly to an image recognition model, but the model is trained with audio clips instead of images. For a review on how machine learning models work, see Activity 1: Image Recognition.   This activity utilizes the Snap! programming language.  If you are not familiar with Snap!, we recommend you complete our Finch with Snap! tutorials prior to starting this activity.  

Ready To Get Started?

Training your First Audio Recognition Model*

*You will need access to a computer with a microphone to complete this activity.

In this activity, you will train your first Audio Recognition Model using Google’s Teachable Machine. To learn more about Google’s Teachable Machine and how prediction models work, see Activity 1: Image Recognition.

Find a quiet room to complete this activity. Because your audio recognition model will make its predictions using sound, too much background noise will impact its accuracy.

Open Google Teachable Machine in a new tab or window. This tab will need to stay open and visible on your screen while you work. This button will take you directly to start a new Audio Project.

Google Teachable Machine

For background information about AI terms like “class”, visit the first activity in this series, Image Recognition (Step 3). 

This is very similar to the process that you followed to create your Image Recognition model. For this Audio Recognition model, you will need three different classes, including a specific class for background sound. 

Create three different classes in your image recognition model. Name them Background Noise, Stop, and Go.

Next, we will collect audio samples to train the model. Your model will use these samples to learn how to predict which class it hears.

Select the Background Noise class. Before recording your Background Noise audio samples, make sure that the room is relatively quiet. When you’re ready, click the Mic button, select the correct microphone if necessary, and click Record 20 Seconds. Google Teachable Machine will record 20 seconds of audio.

When you have successfully recorded 20 seconds of quiet background noise, click the Extract Sample button. (If you need to record again before extracting samples, you can click the Record 20 Seconds button again.)

Your Background Noise class should now show 20 sound samples.

Click on the pencil next to “Class 2” and rename it “Go”. Click “Add a Class” and rename “Class 3” to “Stop”.

Before you add samples for your two new classes, Go and Stop, you need to change your settings.

Select the Go class and click the Mic button. Then, click the settings gear next to the Record 2 Seconds button.

The default duration is 2 seconds. Change the duration to 1 second. Click Save Settings.

Repeat this step for your Stop class.

Next, add audio samples for your Go class. Click the Mic button and, when you are ready to record, click Record 1 Seconds. Say the word “Go” loudly and clearly. 

After recording, you can use the play button to hear your recording. Listen and be sure that the sound is clear. If you are happy with your recorded sound, click Extract Sample. The new audio sample will show up in the Audio Samples column on the right. 

Repeat this step at least 8 times for the Go class.

Repeat these steps in the Stop class, saying the word “Stop” loudly and clearly.

When you have finished recording audio data, you should have at least twenty samples of background noise and at least eight samples for each of the other classes. 

Once you have collected enough data, click Train Model. Training the model will take a few minutes. It may take longer depending on your device and internet speed.

WARNING

Your browser may pop up a warning that the window is unresponsive – ignore this and allow the machine to keep working. The tab MUST stay open and visible while the model is training. 

You may need to close other tabs on your computer.

When your model is finished training, your screen will say Model Trained, and a Preview panel will appear on the right side of the screen. Now, you can test your model.

Say the words “Stop” and “Go”, and watch the way that your model reacts. It should recognize these sounds in the Preview panel. When you are not speaking, it should recognize these sounds as background noise. If it doesn’t, you may need to add more audio samples for each class and train again. 

When you are satisfied with the way your model works, click Export Model.

When the Export window pops up, leave the default settings as they are. Click the Upload my Model button.

Note: If you have already uploaded your model and are updating it, the button will read Update my Cloud Model.

After your model has uploaded, you will see a sharable link highlighted in blue. You will need this link in the following steps to create a Snap! project with your model and complete this activity. Copy the shareable link. After you copy the link, you can close the export window. (You can always open the window and access the link again later.)

You can save your model to reference or change it later by saving it to your computer or to your Google Drive. To do this, click on the Teachable Machine menu and select the option you would like.

Next Step

Using the Audio Classifier in Snap!

Open snap.birdbraintechnologies.com in a new tab. Connect your Finch Robot to Snap!. Click here for instructions on how to connect your Finch Robot to Snap!.

SNAP.BIRDBRAINTECHNOLOGIES.COM

 

Using the audio classifier in Snap! Is very similar to the process you used in Activity 1 for the image classifier.

If you are using snap.birdbraintechnologies.com, download and import this project into Snap!: 

Snap! Audio Recognition (Browser)

If you are using the BlueBird Connector instead of snap.birdbraintechnologies.com, open the project in Snap! with the following button and save a copy for yourself.

Snap! Audio Recognition

Click on the Settings menu and enable JavaScript extensions. You will need them for this project.

At the top of your Snap! window, there is an orange “set” block. Modify this script to set the URL variable to the link for your audio classifier that you copied from Google Teachable Machine.

Once your URL is pasted into the variable space in the Set block, click the block to run it. The blocks will appear to glow when they are clicked. You will only need to run this script once.

Watch the screen. If the model loaded correctly, you will see a message flash that reads “Model loaded successfully”. The URL will also update in the Snap! stage on the right. If you do not see this message, check that you pasted the URL correctly and try clicking the stack again.

Your program is ready to run your model! Press the spacebar to see your classifier make predictions in Snap!. It will take up to a minute for the classification to start the first time you run the script.

Now that you know the probabilities that correspond with each class, you can use that information to control the Finch’s actions. You can use the probabilities to make the Finch perform certain actions when each class is detected.

When the probability that the model recognizes the audio as “Go” is greater than 90%, the Finch will move forward at 20% speed. When the probability that the model recognizes the audio as “Stop” is greater than 90%, the Finch’s wheels will stop moving.

Next Step

Controlling the Finch Robot with your Audio Recognition Model in Snap!

Now that your model can predict what command it is hearing, we can use that data to control the Finch Robot’s actions. How will you have your Finch Robot respond to your different classes? Use the model that you just created as your beginning code – then modify and create!

Try these challenges:

  • Modify the program to add new commands for turning left and right. (You will need to retrain your model to add new classes.)
  • Modify the program to make the Finch move at different speeds according to the volume or speed of your voice. 
  • Modify the program to light up or make sounds as it moves.
  • Combine movement, sound, and lights to create a different reaction for each command.

Here are some tips to make editing your program quick and easy:

  • Right click and select Duplicate to make a copy of any of our sample code, and pick it up. Then you can edit that code!
  • To edit which item your code is responding to, highlight the item number, delete it, and type in your new number. Do this instead of attempting to select from the drop-down

As you test your program, notice what happens if the computer hears audio that is not similar to something in your training data.

BACK TO MAIN PAGE

Back to Top
  • Intro to AI and Machine Learning
  • Training your First Audio Recognition Model
  • Using the Audio Classifier in Snap!
  • Controlling the Finch Robot with your Audio Recognition Model in Snap!
  • Back to Main Page
BirdBrain Technologies
  • Contact Us
  • Join Our Mailing List
  • Support
NSF logo

National Science Foundation Research and development has been generously funded in part by STTR grants from the National Science Foundation.

Copyright ©2010-2026 BirdBrain Technologies. All rights reserved.

Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
  • Manage options
  • Manage services
  • Manage {vendor_count} vendors
  • Read more about these purposes
View preferences
  • {title}
  • {title}
  • {title}
Manage Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
  • Manage options
  • Manage services
  • Manage {vendor_count} vendors
  • Read more about these purposes
View preferences
  • {title}
  • {title}
  • {title}