Amazon Polly

Product Details

Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products. Polly's Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries. In addition to Standard TTS voices, Amazon Polly offers Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a Newscaster reading style that is tailored to news narration use cases, and a Conversational speaking style that is ideal for two-way communication like telephony applications. Finally, Amazon Polly Brand Voice can create a custom voice for your organization. This is a custom engagement where you will work with the Amazon Polly team to build an NTTS voice for the exclusive use of your organization.

Features of Amazon Polly

Natural sounding voices Amazon Polly provides dozens of languages and a wide selection of natural-sounding male and female voices. Amazon Polly's fluid pronunciation of text enables you to deliver high-quality voice output for a global audience.
Store & redistribute speech Amazon Polly allows for unlimited replays of generated speech without any additional fees. You can create speech files in standard formats like MP3 and OGG, and serve them from the cloud or locally with apps or devices for offline playback.
Real-time streaming Delivering lifelike voices and conversational user experiences requires consistently fast response times. When you send text to Amazon Polly’s API, it returns the audio to your application as a stream so you can play the voices immediately.
Customize & control speech output Modify Amazon Polly voices to best suit your needs – Amazon Polly supports lexicons and SSML tags which enable you to control aspects of speech, such as pronunciation, volume, pitch, speed rate, etc.
Low cost Amazon Polly’s pay-as-you-go pricing, low cost per character converted, and unlimited replays make it a cost-effective way to voice your applications.

Pricing

PAY-AS-YOU-GO MODEL

Free Tier

You are billed monthly for the number of characters of text that you processed. Amazon Polly’s Standard voices are priced at $4.00 per 1 million characters for speech or Speech Marks requests (when outside the free tier). Amazon Polly’s Neural voices are priced at $16.00 per 1 million characters for speech or Speech Marks requested (when outside the free tier).

MILLIONS OF CHARACTERS PER MONTH

For Amazon Polly’s Standard voices, the free tier includes 5 million characters per month for speech or Speech Marks requests, for the first 12 months, starting from your first request for speech. For Neural voices, the free tier includes 1 million characters per month for speech or Speech Marks requests, for the first 12 months, starting from your first request for speech.

25 Best Alternatives of Amazon Polly

Voicepoint

Voicepoint is a market-leading Swiss provider of digital dictation systems, speech recognition software and dictation management solutions. We help our customers in sectors heavily reliant on documentation (such as healthcare and the law) to optimise their administrative processes. Our solutions...See More

1LikesVisit Website

iSpeech

iSpeech provides human quality text to speech and speech recognition solutions to consumers, developers and businesses worldwide.-Leading developer of speech-enabled mobile apps: 30+ million downloads of iSpeech apps -Leading speech development platform: 25,000+ developers and billions of API calls -Growing...See More

3LikesVisit Website

Replica Studios

Replica has developed an AI that can replicate the human voice, and have built text-to-speech software to produce expressive speech.Replica is growing a marketplace where creative talent and voice actors can scale and license their voices for use in games,...See More

2LikesVisit Website

VoxSciences

Listening to voice messages can be terribly inefficient and laborious. VoxSciences™ provides a paradigm shift by transcribing voice messages into text messages. This gives voice messages a quantum leap to join email, SMS and IM on an equal basis with...See More

1LikesVisit Website

Voice Report

Voice Report enables field employees to dictate reports while on the go, using a highly secure speech-to-text solution. Record your voice from any device and securely access your transcription online from anywhere.Dictate from anywhere at any time using your favorite...See More

2LikesVisit Website

Speechmatics

Speechmatics® powers applications that require mission-critical, accurate speech recognition using its any-context speech recognition engine. Speechmatics’ speech recognition technology is used by enterprises in scenarios such as contact centers, CRM, consumer electronics, security, media & entertainment and software. Speechmatics processes...See More

Amazon Polly

Product Details

Features of Amazon Polly

Pricing

25 Best Alternatives of Amazon Polly

Voicepoint

iSpeech

Replica Studios

VoxSciences

Voice Report

Speechmatics

Crescendo Systems

Phonexia

Sound Transcription

LumenVox

SpeechWrite

Adobe Podcast

Voicemod

Krisp

Maverick

Beatoven.ai

Cleanvoice AI

MusicLM

Adobe Enhance Speech

Audyo

Descript

AudioStrip

Altered

Podcastle

Listen2 AI

Amazon Polly Reviews