Transcription Add-On

Automatically convert voicemail recordings to text for easier processing, searchability, and accessibility.

What It Does

Converts voicemail audio to text automatically
Provides searchable message content
Enables quick message review without playback
Supports multiple languages
Integrates with email notifications and CRM systems

When to Use

High volume of voicemails requiring quick review
Need searchable message archives
Accessibility requirements (hearing impaired staff)
Want to enable automated message routing/classification
Integration with text-based systems (CRM, ticketing)

Compatible Workflows

✅ Connie Voice Direct + Wait Experience: Transcribe voicemails from menu option
✅ Connie Voice Direct + Voicemail: Transcribe all recorded messages
⚠️ Connie Voice Direct: Only applicable if voicemail add-on also enabled

Technical Implementation

Transcription Services

Twilio Transcription (Recommended)

Built-in integration with Twilio recordings
Automatic processing
Multiple language support
Cost-effective for most use cases

Google Speech-to-Text

Higher accuracy for complex audio
Advanced language models
Real-time transcription capabilities
Custom vocabulary support

AWS Transcribe

Medical and legal vocabulary specialization
Speaker identification
Custom language models
Advanced punctuation and formatting

Basic Setup

// Enable transcription in Studio Flow
const recordingConfig = {
  transcribe: true,
  transcription_callback: `https://${context.DOMAIN_NAME}/transcription-webhook`,
  play_beep: true,
  max_length: 300
};

Webhook Handler

exports.handler = async (context, event, callback) => {
  const {
    TranscriptionText,
    TranscriptionStatus,
    RecordingUrl,
    CallSid,
    From
  } = event;
  
  if (TranscriptionStatus === 'completed') {
    // Process completed transcription
    await processTranscription({
      text: TranscriptionText,
      caller: From,
      recording_url: RecordingUrl,
      call_sid: CallSid
    });
  }
  
  callback(null, 'Transcription processed');
};

Advanced Features

Multi-Language Support

const detectLanguage = async (caller_number) => {
  // Auto-detect based on caller history or area code
  const callerProfile = await getCRMProfile(caller_number);
  return callerProfile?.preferred_language || 'en-US';
};

const transcriptionConfig = {
  language: await detectLanguage(event.From),
  enable_automatic_punctuation: true,
  enable_word_confidence: true
};

Quality Enhancement

// Pre-process audio for better transcription
const enhanceAudio = async (recording_url) => {
  return await processAudio(recording_url, {
    noise_reduction: true,
    volume_normalization: true,
    speech_enhancement: true
  });
};

// Post-process transcription text
const cleanTranscription = (text) => {
  return text
    .replace(/\b(um|uh|like)\b/gi, '') // Remove filler words
    .replace(/\s+/g, ' ') // Normalize whitespace
    .trim()
    .replace(/^./, str => str.toUpperCase()); // Capitalize first letter
};

Confidence Scoring

const processTranscriptionWithConfidence = (transcriptionData) => {
  const { text, confidence_scores } = transcriptionData;
  const avgConfidence = confidence_scores.reduce((a, b) => a + b) / confidence_scores.length;
  
  return {
    text: text,
    confidence: avgConfidence,
    quality: avgConfidence > 0.8 ? 'high' : avgConfidence > 0.6 ? 'medium' : 'low',
    requires_review: avgConfidence < 0.6
  };
};

Integration Features

Email Integration

// Include transcription in email notifications
const emailContent = `
  <h3>New Voicemail Message</h3>
  <p><strong>From:</strong> ${caller_number}</p>
  <p><strong>Duration:</strong> ${duration} seconds</p>
  
  <h4>Message Transcript:</h4>
  <div style="background: #f5f5f5; padding: 15px; font-family: monospace;">
    ${transcription_text}
  </div>
  
  <p><em>Confidence: ${confidence_score}%</em></p>
  <p><a href="${recording_url}">Listen to Original Recording</a></p>
`;

CRM Integration

// Create structured data for CRM
const createCRMEntry = async (transcriptionData) => {
  const keywords = extractKeywords(transcriptionData.text);
  const sentiment = analyzeSentiment(transcriptionData.text);
  
  return {
    caller_number: transcriptionData.caller,
    message_text: transcriptionData.text,
    keywords: keywords,
    sentiment: sentiment,
    confidence: transcriptionData.confidence,
    recording_url: transcriptionData.recording_url,
    created_at: new Date()
  };
};

Search and Analytics

// Make transcriptions searchable
const indexTranscription = async (transcriptionData) => {
  await searchIndex.add({
    id: transcriptionData.call_sid,
    content: transcriptionData.text,
    caller: transcriptionData.caller,
    timestamp: new Date(),
    tags: extractTags(transcriptionData.text)
  });
};

// Enable analytics
const analyzeContent = (text) => {
  return {
    word_count: text.split(' ').length,
    topics: extractTopics(text),
    entities: extractEntities(text),
    urgency_level: determineUrgency(text)
  };
};

Custom Vocabulary

Domain-Specific Terms

// Add organization-specific vocabulary
const customVocabulary = [
  { phrase: "Community Health Center", sounds_like: "community health center" },
  { phrase: "SNAP benefits", sounds_like: "snap benefits" },
  { phrase: "housing assistance", sounds_like: "housing assistance" },
  { phrase: "case manager", sounds_like: "case manager" }
];

const transcriptionConfig = {
  vocabulary_name: 'nonprofit_services',
  custom_vocabulary: customVocabulary
};

Medical/Legal Terms

// Specialized vocabulary for healthcare organizations
const medicalVocabulary = [
  "Medicaid", "Medicare", "prescription", "diagnosis", 
  "appointment", "referral", "insurance", "copay"
];

const legalVocabulary = [
  "eviction", "custody", "immigration", "asylum",
  "deportation", "legal aid", "court date"
];

Quality Assurance

Automatic Review Triggers

const needsReview = (transcription) => {
  const triggers = [
    transcription.confidence < 0.6,
    transcription.text.length < 10,
    transcription.text.includes('[inaudible]'),
    detectUrgentKeywords(transcription.text),
    transcription.duration > 180 // 3 minutes
  ];
  
  return triggers.some(trigger => trigger);
};

Human Review Workflow

if (needsReview(transcription)) {
  await createReviewTask({
    transcription_id: transcription.id,
    priority: transcription.urgency_level,
    assigned_to: 'transcription_review_team',
    due_date: addHours(new Date(), 4)
  });
}

Accuracy Monitoring

// Track transcription accuracy over time
const logAccuracy = async (transcription_id, human_corrected_text) => {
  const original = await getTranscription(transcription_id);
  const accuracy = calculateSimilarity(original.text, human_corrected_text);
  
  await logMetric('transcription_accuracy', accuracy, {
    language: original.language,
    duration: original.duration,
    confidence: original.confidence
  });
};

Privacy and Security

Data Handling

// Ensure PII protection in transcriptions
const sanitizeTranscription = (text) => {
  return text
    .replace(/\b\d{3}-\d{2}-\d{4}\b/g, '[SSN]') // Social Security Numbers
    .replace(/\b\d{16}\b/g, '[CARD]') // Credit card numbers
    .replace(/\b\d{3}-\d{3}-\d{4}\b/g, '[PHONE]') // Phone numbers (optional)
    .replace(/\b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Z|a-z]{2,}\b/g, '[EMAIL]'); // Email addresses
};

Retention Policies

// Automatic cleanup of old transcriptions
const cleanupOldTranscriptions = async () => {
  const cutoffDate = new Date();
  cutoffDate.setMonth(cutoffDate.getMonth() - 6); // 6 months retention
  
  await deleteTranscriptions({
    created_before: cutoffDate,
    exclude_flagged: true // Keep flagged items longer
  });
};

Access Controls

// Role-based access to transcriptions
const canAccessTranscription = (user, transcription) => {
  const permissions = {
    admin: () => true,
    supervisor: () => transcription.department === user.department,
    agent: () => transcription.assigned_to === user.id,
    readonly: () => transcription.public === true
  };
  
  return permissions[user.role]?.() || false;
};

Performance Optimization

Batch Processing

// Process multiple transcriptions efficiently
const batchProcessTranscriptions = async (transcriptions) => {
  const batches = chunk(transcriptions, 10);
  
  for (const batch of batches) {
    await Promise.all(batch.map(processTranscription));
    await sleep(1000); // Rate limiting
  }
};

Caching Strategy

// Cache frequently accessed transcriptions
const getCachedTranscription = async (call_sid) => {
  const cacheKey = `transcription:${call_sid}`;
  let transcription = await cache.get(cacheKey);
  
  if (!transcription) {
    transcription = await database.getTranscription(call_sid);
    await cache.set(cacheKey, transcription, 3600); // 1 hour
  }
  
  return transcription;
};

Troubleshooting

Common Issues

Poor transcription quality

Check audio quality and format
Verify language settings
Consider background noise reduction
Update custom vocabulary

Missing transcriptions

Verify webhook configuration
Check transcription service status
Monitor callback delivery
Review error logs

High processing delays

Monitor service response times
Implement retry mechanisms
Consider alternative providers
Check network connectivity

Monitoring and Alerts

// Monitor transcription health
const monitorTranscription = async () => {
  const metrics = await getTranscriptionMetrics();
  
  if (metrics.success_rate < 0.95) {
    await sendAlert({
      type: 'transcription_failure',
      message: `Transcription success rate dropped to ${metrics.success_rate}%`,
      severity: 'high'
    });
  }
  
  if (metrics.avg_processing_time > 30) {
    await sendAlert({
      type: 'transcription_delay',
      message: `Average processing time: ${metrics.avg_processing_time}s`,
      severity: 'medium'
    });
  }
};

Best Practices

Audio Optimization

Use high-quality recording settings
Minimize background noise
Ensure proper microphone levels
Consider audio preprocessing

Language Configuration

Set appropriate language models
Use region-specific variants
Update vocabulary regularly
Test with actual caller accents

Integration Design

Design for transcription delays
Provide fallback options
Handle failures gracefully
Maintain original recordings

What It Does​

When to Use​

Compatible Workflows​

Technical Implementation​

Transcription Services​

Twilio Transcription (Recommended)​

Google Speech-to-Text​

AWS Transcribe​

Basic Setup​

Webhook Handler​

Advanced Features​

Multi-Language Support​

Quality Enhancement​

Confidence Scoring​

Integration Features​

Email Integration​

CRM Integration​

Search and Analytics​

Custom Vocabulary​

Domain-Specific Terms​

Medical/Legal Terms​

Quality Assurance​

Automatic Review Triggers​

Human Review Workflow​

Accuracy Monitoring​

Privacy and Security​

Data Handling​

Retention Policies​

Access Controls​

Performance Optimization​

Batch Processing​

Caching Strategy​

Troubleshooting​

Common Issues​

Monitoring and Alerts​

Best Practices​

Audio Optimization​

Language Configuration​

Integration Design​

What It Does

When to Use

Compatible Workflows

Technical Implementation

Transcription Services

Twilio Transcription (Recommended)

Google Speech-to-Text

AWS Transcribe

Basic Setup

Webhook Handler

Advanced Features

Multi-Language Support

Quality Enhancement

Confidence Scoring

Integration Features

Email Integration

CRM Integration

Search and Analytics

Custom Vocabulary

Domain-Specific Terms

Medical/Legal Terms

Quality Assurance

Automatic Review Triggers

Human Review Workflow

Accuracy Monitoring

Privacy and Security

Data Handling

Retention Policies

Access Controls

Performance Optimization

Batch Processing

Caching Strategy

Troubleshooting

Common Issues

Monitoring and Alerts

Best Practices

Audio Optimization

Language Configuration

Integration Design