|
1 | 1 | ### YamlMime:ModuleUnit |
2 | | -uid: learn.tensorflow.intro-audio-classification-tensorflow.visualizations-transforms |
| 2 | +uid: learn.tensorflow.intro-audio-classification.visualizations-transforms |
3 | 3 | title: Visualizing and transforming data |
4 | 4 | metadata: |
5 | 5 | title: Visualizing and transforming data |
6 | 6 | description: Visualizing and transforming data |
7 | 7 | author: Orin-Thomas |
8 | 8 | ms.author: orthomas |
9 | | - ms.date: 08/03/2021 |
| 9 | + ms.date: 04/20/2026 |
| 10 | + ms.update-cycle: 180-days |
10 | 11 | ms.topic: unit |
| 12 | + ms.collection: ce-advocates-ai-copilot |
11 | 13 | ms.custom: |
12 | 14 | - team=nextgen |
13 | 15 | - team=cloud_advocates |
14 | 16 | ms.product: learning-tensorflow |
15 | 17 | ms.contributors: |
16 | 18 | - cassieb-08202021 |
17 | 19 | durationInMinutes: 15 |
18 | | -sandbox: true |
19 | | -notebook: notebooks/3-visualizations-transforms.ipynb |
| 20 | +content: | |
| 21 | + [!include[](includes/3-visualizations-transforms.md)] |
20 | 22 | quiz: |
21 | 23 | title: Check your knowledge |
22 | 24 | questions: |
23 | 25 | - content: "When you resample the audio, you are..." |
24 | 26 | choices: |
25 | | - - content: "Increasing the size." |
26 | | - isCorrect: false |
27 | | - explanation: "Incorrect." |
28 | | - - content: "Reducing the size." |
| 27 | + - content: "Changing the number of samples used to represent each second of audio." |
29 | 28 | isCorrect: true |
30 | | - explanation: "Correct! We can reduce the size of the file by reducing the sample rate for the audio track." |
| 29 | + explanation: "Correct. Resampling changes the sample rate; it can downsample or upsample depending on the target rate." |
| 30 | + - content: "Always increasing the size." |
| 31 | + isCorrect: false |
| 32 | + explanation: "Incorrect. Resampling can increase or decrease the number of samples." |
| 33 | + - content: "Always reducing the size." |
| 34 | + isCorrect: false |
| 35 | + explanation: "Incorrect. Downsampling can reduce size, but resampling also includes upsampling." |
31 | 36 | - content: "What is a spectrogram?" |
32 | 37 | choices: |
33 | | - - content: "Maps the frequency to time of an audio file." |
| 38 | + - content: "A visualization of frequency content over time, usually with intensity or color showing magnitude." |
34 | 39 | isCorrect: true |
35 | | - explanation: "Correct!" |
36 | | - - content: "The audio channels." |
| 40 | + explanation: "Correct. A spectrogram shows how the strength of different frequencies changes over time." |
| 41 | + - content: "The number of audio channels." |
37 | 42 | isCorrect: false |
38 | | - explanation: "Incorrect." |
39 | | - - content: "Sample rate and frequency visualized." |
| 43 | + explanation: "Incorrect. Channels describe separate audio signals, such as left and right stereo channels." |
| 44 | + - content: "The amplitude of the audio signal over time." |
40 | 45 | isCorrect: false |
41 | | - explanation: "Incorrect this is a waveform." |
42 | | - - content: "Audio classification can only be done with computer vision on spectrograms." |
| 46 | + explanation: "Incorrect. That describes a waveform." |
| 47 | + - content: "Which input representation can be used for audio classification?" |
43 | 48 | choices: |
44 | | - - content: "True" |
45 | | - isCorrect: False |
46 | | - explanation: "Incorrect. There's more than one way to build audio classification models." |
47 | | - - content: "False" |
48 | | - isCorrect: True |
49 | | - explanation: "Correct! There's more than one way to build audio classification models." |
| 49 | + - content: "Waveforms, engineered audio features, or spectrogram tensors, depending on the model design." |
| 50 | + isCorrect: true |
| 51 | + explanation: "Correct. This module uses spectrograms, but audio classifiers can also learn from raw waveforms or other audio features." |
| 52 | + - content: "Only PNG images created from spectrograms." |
| 53 | + isCorrect: false |
| 54 | + explanation: "Incorrect. Saving spectrograms as images is optional and can add unnecessary file I/O or resizing artifacts." |
| 55 | + - content: "Only the number of audio channels." |
| 56 | + isCorrect: false |
| 57 | + explanation: "Incorrect. Channel count is useful metadata, but it doesn't represent the audio pattern to classify." |
0 commit comments