AI Tools Directory

Audio AI tools

Audio tools range from cloud APIs and studio voice generators to consumer voice changers. Compare licensing for voice cloning, latency for real-time use, language coverage, and whether outputs are safe for commercial broadcast.

13 listings · Last structured refresh batch April 2026 (spot-check official sites before adoption).

Alibaba Cloud Intelligent Speech

Alibaba Cloud intelligent speech APIs for TTS and interaction services metered on Alibaba accounts.

Amazon Polly

AWS-managed speech synthesis for Lex, contact centers, and app backends.

Descript

Transcript-first audio editing, Studio Sound cleanup, and cautious overdub for podcasts.

ElevenLabs

High-quality neural TTS, voice design, and dubbing APIs for apps and media.

Google Cloud Text-to-Speech

Google Cloud TTS and Chirp models for apps, IVR, and accessibility at scale.

iFlytek Open Platform

iFlytek open platform APIs for Mandarin ASR, neural TTS, wake words, and optional voice biometrics.

Listnr

TTS plus podcast publishing workflow for solo creators and agencies.

Murf

Timeline voiceover studio syncing AI voices with slides and explainer timelines.

Natural Reader

Consumer and education TTS for reading documents, PDFs, and web pages aloud.

Play.ht

Embeddable AI audio players and TTS for turning articles into listenable experiences.

Resemble AI

API-first cloning, fill, and real-time speech with enterprise deployment options.

Tencent Cloud TTS

Tencent Cloud speech APIs for TTS, telephony, and IoT workloads tied to Tencent billing.

Voicemod

Low-latency voice filters and soundboard for live streaming and multiplayer chat.