Development notes (internal)¶

This document is for contributors. User-facing docs live in:

Layout¶

abstractvoice/vm/ — VoiceManager façade + mixins (TTS/STT/cloning orchestration)
abstractvoice/adapters/ — adapter implementations (Piper / AudioDiT / OmniVoice TTS; Faster-Whisper STT)
abstractvoice/audiodit/ — AudioDiT runtime + HF model implementation (vendored code; avoids trust_remote_code)
abstractvoice/omnivoice/ — OmniVoice runtime wrapper (offline-first + device/dtype policy glue)
abstractvoice/tts/ — audio playback utilities (NonBlockingAudioPlayer)
abstractvoice/cloning/ — optional cloning engines + voice store (f5_tts / chroma / audiodit / omnivoice)
abstractvoice/examples/ — REPL and demo entrypoints

Implementation points:

Piper downloads are gated in abstractvoice/adapters/tts_piper.py.
Faster-Whisper offline mode is enforced in abstractvoice/adapters/stt_faster_whisper.py.
Torch engine snapshots are resolved offline-first in their runtimes (abstractvoice/audiodit/runtime.py, abstractvoice/omnivoice/runtime.py).
Cloning downloads are explicit per engine (abstractvoice/cloning/engine_f5.py, abstractvoice/cloning/engine_chroma.py, abstractvoice/cloning/engine_audiodit.py, abstractvoice/cloning/engine_omnivoice.py).

abstractvoice/tts/tts_engine.py provides:

NonBlockingAudioPlayer (pause/resume/stop)
_SilenceStderrFD to suppress OS-level stderr spam that can corrupt terminal UI

The REPL avoids printing the prompt manually to prevent duplicate prompts (> >).

Cloned synthesis runs in a background thread in abstractvoice/vm/tts_mixin.py:

Cloning engines can be very large (especially Chroma). The REPL:

Core support:

python -m pytest -q

For CI/release runs, keep model-download and optional integration tests out of the default pass:

python -m pytest -q -m "not integration and not model_download"

AbstractVoice mirrors the AbstractCore release shape:

.github/workflows/ci.yml runs tests on Python 3.9-3.12 and verifies that source/wheel distributions build and pass twine check; it also smoke-builds the MkDocs site.
.github/workflows/release.yml runs the same test gate, validates that the requested tag matches abstractvoice/_version.py, extracts release notes from CHANGELOG.md, publishes to PyPI via trusted publishing, and creates a GitHub Release with the built distributions attached. Release runs also publish the MkDocs site to the gh-pages branch.

Release checklist:

Update abstractvoice/_version.py (__version__, the single version source).
Move CHANGELOG.md notes from [Unreleased] into a dated version section.
Push a tag like v0.9.1, or run the Release workflow manually with version=0.9.1.

The PyPI workflow expects a GitHub environment named pypi configured for trusted publishing on the abstractvoice project.