TEXT TO AUDIO CONVERSION WITH STYLE CONDITIONING

Number of patents in Portfolio can not be more than 2000

United States of America

APP PUB NO 20250054487A1
SERIAL NO

18233294

Stats

ATTORNEY / AGENT: (SPONSORED)

Importance

Loading Importance Indicators... loading....

Abstract

See full text

A style encoder can be trained to encode audio style and audio characteristics into selected regions of a style vector. The style vector can be used to condition a text to speech (TTS) model to generate speech with human-understandable and controllable styles. Various training strategies of the style encoder are described, including a first, second and third training strategy that can be used to disentangle audio styles into selected regions of a style vector. The distinct regions of the style vector can be used to provide numerous customization options to a user of the described system, along with tools to generate speech with a speaker identity and using selected audio styles and characteristics.

Loading the Abstract Image... loading....

First Claim

See full text

Family

Loading Family data... loading....

Patent Owner(s)

Patent OwnerAddress
NARO CORP5668 PEMBROOKE CROSSING WEST BLOOMFIELD MI 48322

International Classification(s)

  • [Classification Symbol]
  • [Patents Count]

Inventor(s)

Inventor Name Address # of filed Patents Total Citations
Frenzel, Max Florian Tokyo, JP 3 1
Silverstein, Todd Tokyo, JP 21 1017
Stein, Lyle Patrick Toronto, CA 3 1

Cited Art Landscape

Load Citation

Patent Citation Ranking

Forward Cite Landscape

Load Citation