AI Sound

AI-native audio editor built as a modern replacement for Audacity, with LLM integration at its core. Features multi-track editing, AI transcription, speaker diarization, semantic search, and a full MCP server for external AI assistant integration.

Solo DeveloperNew Paradigm Web DevelopmentMarch 2026 - Present0 months

About This Project

AI Sound is designed from the ground up with AI integration — not bolted on as an afterthought. It exposes all editing capabilities through both a conversational AI assistant and an MCP server, enabling AI tools like Claude Desktop to edit, analyze, and export audio projects. Supports non-destructive multi-track editing, audio effects (normalize, compress, EQ, reverb, noise reduction), word-level transcription, speaker identification, and semantic content search.

Key Achievements

  • Architected an MCP server that exposes all audio editing operations as composable tools — AI assistants can import, trim, mix, transcribe, and export without understanding audio internals
  • Built AI-powered transcription pipeline with word-level timestamps and speaker diarization that integrates directly with the editing timeline
  • Implemented semantic search across transcribed audio content — find segments by meaning, not just keywords
  • Designed a non-destructive editing engine with full undo/redo history, markers, and automation envelopes
  • Audio effects chain (normalize, compress, EQ, reverb, noise reduction, fade, pitch shift, speed) all accessible via both UI and MCP tool calls
  • Cross-project track import enabling composable audio workflows between multiple projects
  • LLM-agnostic architecture — runs fully local with Ollama or connects to any OpenAI-compatible cloud provider

Related Projects

Related Publications

Articles that cover technologies and skills used in this project

Prop Drilling is a Code Smell When Used Incorrectly
[M]medium
Oct 10, 20244 min read

Prop Drilling is a Code Smell When Used Incorrectly

Prop drilling is considered an anti-pattern in React because it involves passing props down multiple levels of a component tree, even when only a few components need them. This can lead to several...

Read more ->
JavaScriptJavaScriptReactReact
Technical CommunicationTechnical Communication +1 more
AWS Lays off 14k Workers and Blames it on AI Automation, I call Bullshit.
[M]medium
Nov 6, 20256 min read

AWS Lays off 14k Workers and Blames it on AI Automation, I call Bullshit.

I believe the layoffs at AWS have less to do with AI automation directly and more to do with their failure to match services like OpenAI and Google in the race to provide AI platforms that just work...

Read more ->
AWSAWSOpenAIOpenAI
Technology EvaluationTechnology Evaluation +2 more
The Future of AI Isn’t a Chatbot: Where the Real Value Lies.
[M]medium
Oct 11, 20256 min read

The Future of AI Isn’t a Chatbot: Where the Real Value Lies.

ShitGTP - WAN 2.2 and Qwen ImageEvery tech CEO and C-suite executive right now is absolutely foaming at the mouth with lust for anything labeled “AI,” preferably with a sleek chat box where you can...

Read more ->
OpenAIOpenAIOllamaOllama
Technical LeadershipTechnical Leadership +5 more

Project Details

Timeline
March 2026 - Present
Duration
0 months
Role
Solo Developer

Technologies Used

TypeScriptTypeScriptReactReactNode.jsNode.jsSQLiteSQLiteOpenAIOpenAIOllamaOllama*MCP

Skills Demonstrated

Architecture PlanningArchitecture PlanningTechnology EvaluationTechnology EvaluationTechnical CommunicationTechnical Communication