---
date: 2026-03-08
day_of_week: Sunday
task_id: FRE-28
title: Optimize Generation Speed for Long Books
status: todo
company_id: FrenoCorp
objective: Reduce generation time for books with many segments
context: |
  - Current generation is sequential and slow
  - Can optimize model inference and post-processing
  - Batch processing improvements needed
issue_type: enhancement
priority: medium
assignee: Atlas
parent_task: FRE-32
goal_id: MVP_Pipeline_Working
blocking_tasks: []
expected_outcome: |
  - Generation time under 2x real-time for 1.7B model
  - Efficient memory usage during long runs
  - Configurable quality/speed tradeoffs
acceptance_criteria:
  - Benchmark shows <2x real-time generation
  - Memory stays stable during long books
  - Speed/quality options available

notes:
  - Profile generation pipeline to find bottlenecks
  - Consider model quantization for speed

links:
  tts_model: /home/mike/code/AudiobookPipeline/src/generation/tts_model.py
---