Ref: https://learn.cantrill.io/courses/1820301/lectures/42176908
Amazon Transcribe - Key Concepts
- đź”§Â Automatic Speech Recognition (ASR) service
- Audio (input) → Text (output)
- Pay-per-use: billed per second of transcribed audio
- 💡 ASR is a DL process
- Features:
- Automatic language identification for multi-lingual audio
- Language customization
- Filters for privacy
- e.g. filter out PII (identify and redact PII)
- Audience-appropriate language
- Speaker identification
- Improve accuracy for domain-specific, non-standard terms with:
- Custom vocabularies (words)
- Custom language models (context)
- Provide domain-specific text for Transcribe to learn context of the domain-specific words
- Use cases
- Full text indexing of audio → allows searching
- Meeting notes
- Subtitles/captions and transcripts
- Amazon Transcribe Call Analytics → Phone call analytics
- characteristics, summarization, categories, sentiment…
- Amazon Transcribe Medical
Amazon Transcribe - Toxicity Detection
Ref: https://www.udemy.com/course/aws-ai-practitioner-certified/learn/lecture/44887629
- Transcribe can detect and score voice-based toxicity
- Leverages both speech cues (tone, pitch) & text cues
- Toxicity categories: sexual harassment, hate speech, threat, abuse, profanity, insult, graphic…
Amazon Transcribe Medical
Ref: https://www.udemy.com/course/aws-ai-practitioner-certified/learn/lecture/44887711
- đź”§Â Automatically convert medical-related speech to text
- âť—HIPAA (Health Insurance Portability and Accountability Act) compliant
- 💡 Enhancement of Amazon Transcribe, sub-product
- Ability to transcribe medical terminologies such as:
- Medicine names
- Procedures
- Conditions and diseases
- Supports both real-time (microphone) and batch (upload files) transcriptions