Product Details

Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Polly's Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries. In addition to Standard TTS voices, Amazon Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications. Finally, Amazon Polly Brand Voice can create a custom voice for your organization. This is a custom engagement where you will work with the Amazon Polly team to build an NTTS voice for the exclusive use of your organization.

Features of Amazon Polly

  • Natural sounding voices Amazon Polly provides dozens of languages and a wide selection of natural-sounding male and female voices. Amazon Polly's fluid pronunciation of text enables you to deliver high-quality voice output for a global audience.
    Store & redistribute speech Amazon Polly allows for unlimited replays of generated speech without any additional fees. You can create speech files in standard formats like MP3 and OGG, and serve them from the cloud or locally with apps or devices for offline playback.
  • Real-time streaming Delivering lifelike voices and conversational user experiences requires consistently fast response times. When you send text to Amazon Polly’s API, it returns the audio to your application as a stream so you can play the voices immediately. 
  • Customize & control speech output Modify Amazon Polly voices to best suit your needs – Amazon Polly supports lexicons and SSML tags which enable you to control aspects of speech, such as pronunciation, volume, pitch, speed rate, etc.  
  • Low cost Amazon Polly’s pay-as-you-go pricing, low cost per character converted, and unlimited replays make it a cost-effective way to voice your applications.

Pricing

PAY-AS-YOU-GO MODEL
Free Tier
You are billed monthly for the number of characters of text that you processed. Amazon Polly’s Standard voices are priced at $4.00 per 1 million characters for speech or Speech Marks requests (when outside the free tier). Amazon Polly’s Neural voices are priced at $16.00 per 1 million characters for speech or Speech Marks requested (when outside the free tier).
MILLIONS OF CHARACTERS PER MONTH

For Amazon Polly’s Standard voices, the free tier includes 5 million characters per month for speech or Speech Marks requests, for the first 12 months, starting from your first request for speech. For Neural voices, the free tier includes 1 million characters per month for speech or Speech Marks requests, for the first 12 months, starting from your first request for speech.

25 Best Alternatives of Amazon Polly

Voicepoint

Voicepoint

Voicepoint is a market-leading Swiss provider of digital dictation systems, speech recognition software and dictation management solutions. We help our customers in sectors heavily reliant on documentation (such as healthcare and the law) to optimise their administrative processes. Our solutions...See More
iSpeech

iSpeech

iSpeech provides human quality text to speech and speech recognition solutions to consumers, developers and businesses worldwide.-Leading developer of speech-enabled mobile apps: 30+ million downloads of iSpeech apps -Leading speech development platform: 25,000+ developers and billions of API calls -Growing...See More
Replica Studios

Replica Studios

Replica has developed an AI that can replicate the human voice, and have built text-to-speech software to produce expressive speech.Replica is growing a marketplace where creative talent and voice actors can scale and license their voices for use in games,...See More
VoxSciences

VoxSciences

Listening to voice messages can be terribly inefficient and laborious. VoxSciences™ provides a paradigm shift by transcribing voice messages into text messages. This gives voice messages a quantum leap to join email, SMS and IM on an equal basis with...See More
Voice Report

Voice Report

Voice Report enables field employees to dictate reports while on the go, using a highly secure speech-to-text solution. Record your voice from any device and securely access your transcription online from anywhere.Dictate from anywhere at any time using your favorite...See More
Speechmatics

Speechmatics

Speechmatics® powers applications that require mission-critical, accurate speech recognition using its any-context speech recognition engine. Speechmatics’ speech recognition technology is used by enterprises in scenarios such as contact centers, CRM, consumer electronics, security, media & entertainment and software. Speechmatics processes...See More
Crescendo Systems

Crescendo Systems

Crescendo Systems Corporation is a leading developer of Documentation, Digital Dictation, Voice Processing, Transcription and Workflow Management systems for the medical, legal, law enforcement and insurance sectors.Established in Laval, Canada in 1990 with a solid focus on providing customer rich...See More
Phonexia

Phonexia

Phonexia transforms voice to knowledge with its innovative speech analytics and voice biometrics technologies. Its Phonexia Speech Engine is the first on the market using exclusively deep neural networks to provide extremely accurate and fast results. The Phonexia Speech Platform...See More
Sound Transcription

Sound Transcription

Sound Transcription serves media professionals, marketers, churches, and the education industry with automatic transcription of interviews, meetings, sermons, lectures, podcasts, webinars, and more. Transcription software for automated audio and video transcription, delivered to your inbox in minutes.Sound Transcription pricing starts...See More
LumenVox

LumenVox

LumenVox is a speech automation and multi-factor biometric authentication solutions company providing core speech technologies that include the LumenVox Speech Recognizer, Text-to-Speech Engine, Call Progress Analysis, Speech Tuner, Natural language solutions support and Multifactor Biometric Authentication. We have won numerous...See More
SpeechWrite

SpeechWrite

SpeechWrite Digital is a full solution provider specialising in workflow solutions, digital dictation, voice recognition and PDF solutions.Our practical technology, sophisticated yet simple, allows you to enhance your working environment and simply work smarter....See More
Adobe Podcast

Adobe Podcast

Adobe Podcast is an AI-powered podcasting platform that helps creators produce high-quality podcasts quickly and easily. It offers a variety of features to help creators. Adobe Podcast's AI-powered audio transcription, enhancement, and editing features can help creators save time and effort, and to...See More
Voicemod

Voicemod

Voicemod is an AI-powered voice modulation and sound effects platform that allows users to change their voice in real time. It offers a variety of features to help users. Voicemod's integration with popular applications makes it easy for users to...See More
Krisp

Krisp

Krisp is an AI-powered noise cancellation and background removal tool that helps users create a professional-sounding audio environment. It offers a variety of features to help users. Krisp uses state-of-the-art AI technology to cancel out background noise with high accuracy. This...See More
Maverick

Maverick

Maverick is an AI-powered audio editing and mastering platform that helps users create professional-sounding audio recordings. It offers a variety of features to help users. Maverick's comprehensive audio publishing tools make it easy for users to publish their audio recordings...See More
Beatoven.ai

Beatoven.ai

Beatoven.ai is an AI-powered music composition platform that helps users create royalty-free music for their projects. It offers a variety of features to help users. Beatoven.ai's royalty-free music feature makes it easy for users to use their music in their...See More
Cleanvoice AI

Cleanvoice AI

Cleanvoice AI is an AI-powered audio editing platform that helps users remove filler words, stuttering, and mouth sounds from their audio recordings. It is a popular tool for podcasters, YouTubers, and other creators who want to improve the quality of...See More
MusicLM

MusicLM

MusicLM AI is a new experimental AI model from Google AI that can generate music from text descriptions, such as "a calming violin melody backed by a distorted guitar riff". It is a hierarchical sequence-to-sequence modeling task, which means that...See More
Adobe Enhance Speech

Adobe Enhance Speech

Adobe Enhance Speech is an AI-powered audio enhancement tool that helps users improve the quality of their recorded speech. It uses AI to reduce noise, echo, and other artifacts from speech recordings. It can also improve the clarity and intelligibility...See More
Audyo

Audyo

Audyo is an AI-powered audio editing and creation platform that helps users to create professional-sounding audio content, such as podcasts, audiobooks, and audiobooks, in a matter of minutes. It offers a variety of features to help users. Audyo provides users...See More
Descript

Descript

Descript is an all-in-one audio and video editing platform that helps users to create professional-sounding and -looking content, even if they have no prior experience with video editing. It offers a variety of features to help users. Descript's editing tools...See More
AudioStrip

AudioStrip

AudioStrip is a relatively new AI-powered audio editing platform that helps users create professional-sounding audio content without the need for expensive equipment or software. AudioStrip provides users with a variety of publishing tools, such as the ability to export audio...See More
Altered

Altered

Altered is an AI-powered music creation platform that helps users create professional-sounding music without the need for any prior musical experience. Altered's AI-powered music generation feature makes it easy for users to create professional-sounding music, even if they have no prior...See More
Podcastle

Podcastle

Podcastle is an all-in-one AI-powered audio and video creation platform. It enables podcasters, creators, interviewers, marketers and others to record, edit, enhance, transcribe, and export their content with unmatched simplicity.Podcastle offers a variety of features to help users create professional-sounding...See More
Listen2 AI

Listen2 AI

Listen2.AI is an innovative mobile application that transforms how users consume news. Powered by cutting-edge artificial intelligence, Listen2.AI delivers personalized, unbiased news in easy-to-digest audio clips, making staying informed quick and convenient. Whether commuting, working out, or multitasking, Listen2.AI offers...See More

Amazon Polly Reviews


No reviews available.