音声合成 (Japanese Wikipedia)

Analysis of information sources in references of the Wikipedia article "音声合成" in Japanese language version.

refsWebsite

Global rank Japanese rank

10arxiv.org

69^th place

227^th place

6worldcat.org

5^th place

19^th place

4doi.org

2^nd place

6^th place

2itmedia.co.jp

624^th place

41^st place

2nii.ac.jp

304^th place

20^th place

2jst.go.jp

1,903^rd place

237^th place

2stanford.edu

179^th place

608^th place

1gnu.org

1,475^th place

1,684^th place

1chunichi.co.jp

825^th place

48^th place

1hut.fi

low place

1nytimes.com

7^th place

63^rd place

1bell-labs.com

5,739^th place

5,877^th place

1robohon.com

low place

1scitation.org

4,352^nd place

low place

1semanticscholar.org

11^th place

1,537^th place

1ieee.org

652^nd place

1,307^th place

1google-research.github.io

low place

1marktechpost.com

low place

1synsig.org

low place

1mindspring.com

low place

1golem.de

3,168^th place

low place

1wikipedia.org

low place

1townnews.co.jp

4,908^th place

293^rd place

1cnet.com

272^nd place

304^th place

1casio.jp

low place

7,718^th place

1healsio.jp

low place

1asahi.com

141^st place

9^th place

1apple.com

67^th place

147^th place

1deepmind.com

low place

1ai-j.jp

low place

1sharp.co.jp

low place

2,765^th place

1amazon.com

105^th place

738^th place

1toyota.jp

low place

1,529^th place

1robotstart.info

low place

6,637^th place

1voicetext.jp

low place

1trafficnews.jp

3,932^nd place

256^th place

1amazon.co.jp

554^th place

50^th place

1nuance.com

low place

1nikkan-gendai.com

1,691^st place

108^th place

ai-j.jp

“5/30サービス開始！NTTドコモの新しいAIエージェント「my daiz」にエーアイの音声合成AITalkが採用株式会社AI（エーアイ）”. 株式会社エーアイ(AI). 2018年11月28日閲覧。

amazon.co.jp

“Amazon.co.jp ヘルプ: 読み上げ機能を使用する”. www.amazon.co.jp. 2018年11月28日閲覧。

amazon.com

developer.amazon.com

“Amazon PollyでAlexaの音声をカスタマイズしよう” (英語) 2018年11月28日閲覧。

apple.com

machinelearning.apple.com

“Deep Learning for Siri’s Voice: On-device Deep Mixture Density Networks for Hybrid Unit Selection Synthesis - Apple” (英語). Apple Machine Learning Journal. 2018年11月28日閲覧。

arxiv.org

van den Oord, Aaron; Dieleman, Sander; Zen, Heiga; Simonyan, Karen; Vinyals, Oriol; Graves, Alex; Kalchbrenner, Nal; Senior, Andrew et al. (2016-09-12). “WaveNet: A Generative Model for Raw Audio” (English). arXiv. arXiv:1609.03499.
Arik, Sercan O.; Chrzanowski, Mike; Coates, Adam; Diamos, Gregory; Gibiansky, Andrew; Kang, Yongguo; Li, Xian; Miller, John et al. (2017-02-25). “Deep Voice: Real-time Neural Text-to-Speech” (English). arXiv. arXiv:1702.07825.
Wang, Yuxuan; Skerry-Ryan, RJ; Stanton, Daisy; Wu, Yonghui; Weiss, Ron J.; Jaitly, Navdeep; Yang, Zongheng; Xiao, Ying et al. (2017-03-29). “Tacotron: Towards End-to-End Speech Synthesis” (English). arXiv. arXiv:1703.10135.
We use the feed-forward Transformer block, …, as the basic structure for the encoder and mel-spectrogram decoder. arxiv
Jaime (2018) TOWARDS ACHIEVING ROBUST UNIVERSAL NEURAL VOCODING https://arxiv.org/abs/1811.06292
Naihan Li, et al. Neural Speech Synthesis with Transformer Network
Better speech synthesis through scaling James Betker 2023年5月23日
XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model Edresson Casanova et al. 2024年6月7日
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers p.5 Chengyi Wang, et al. 2023年
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision Eugene Kharitonov, et al. 2023年

asahi.com

“音声ニュース配信　朝日新聞アルキキ”. www.asahi.com. 2018年11月28日閲覧。

bell-labs.com

Bell Labs: Where "HAL" First Spoke (Bell Labs Speech Synthesis website)

casio.jp

arch.casio.jp

“エクスワードに搭載された快適機能 - 電子辞書 - CASIO”. arch.casio.jp. 2018年11月28日閲覧。

chunichi.co.jp

【Hope】失った私の声で会話を／AI学習そっくり再現：ベンチャー無償提供がん患者らに希望『東京新聞』夕刊2022年8月20日1面（2022年8月27日閲覧）

cnet.com

japan.cnet.com

“阪急電鉄、訪日外国人向け多言語アナウンスサービスを導入--案内情報の印刷も” (日本語). CNET Japan. (2018年5月24日) 2018年11月28日閲覧。

deepmind.com

“WaveNet launches in the Google Assistant | DeepMind”. DeepMind. 2018年11月28日閲覧。

doi.org

徳田, 恵一 (2017). “風雲急を告げる音声合成研究の最新動向”. 情報・システムソサイエティ誌 (電子情報通信学会) 21 (4): 10–11. doi:10.1587/ieiceissjournal.21.4_10. ISSN 2189-9797. NAID 130005312792.
Andrew J., Hunt; Black, Alan W. (1996). “Unit selection in a concatenative speech synthesis system using a large speech database” (English). 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings (IEEE): 373–376. doi:10.1109/ICASSP.1996.541110. ISBN 0-7803-3192-3. ISSN 1520-6149.
Masuko, Takashi; Keiichi, Tokuda; Takao, Kobayashi; Satoshi, Imai (1999-05-09). “Speech synthesis using HMMs with dynamic features” (English). 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings (IEEE): 389–392. doi:10.1109/ICASSP.1996.541114. ISBN 0-7803-3192-3. ISSN 1520-6149.
Gopala K. Anumanchipalli, et al.. (2019) Speech synthesis from neural decoding of spoken sentences [paper]

gnu.org

savannah.gnu.org

Articulatory Speech Synthesis - Summary [Savannah]

golem.de

KI-Sprachforschungsteam von Mozilla macht allein weiter （ドイツ語） Golem.de（ドイツ語版） 2021年3月15日

google-research.github.io

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision Google Research

healsio.jp

“音声対話”. AX-XW400 | ウォーターオーブンヘルシオ：シャープ. 2018年11月28日閲覧。

hut.fi

acoustics.hut.fi

History and Development of Speech Synthesis (Helsinki University of Technology) - 英語

ieee.org

ieeexplore.ieee.org

"Statistical parametric speech synthesis ... as a framework to generate a synthetic speech signal based on a statistical model" Tachibana, et al. (2018). An Investigation of Noise Shaping with Perceptual Weighting for Wavenet-Based Speech Generation. doi: 10.1109/ICASSP.2018.8461332

itmedia.co.jp

「“AIアナウンサー”がラジオ放送　Amazonの音声合成技術で」『ITmedia NEWS』。2018年11月28日閲覧。
「NHKが「人造アナウンサー」開発、コップのフチにいそうな「ニュースのヨミ子」さん」『ITmedia NEWS』。2018年11月28日閲覧。

jst.go.jp

jstage.jst.go.jp

"規則合成は ... 三つの処理に分けることができる ... 第三は韻律情報により規定された音源波形で，パラメータ表現された声道伝達フィルタを駆動して合成波形を生成する処理 ... 音声合成方式は，波形編集方式，分析合成方式，ホルマント合成方式などが規則合成に用いられており" 広川. (1993). 規則合成における音声合成単位及び音声合成法 - より高品質を求めて. 日本音響学会誌 49巻, 12号. pp. 847-853.
"分析合成方式は音声生成過程を音源モデルと声道モデルに分け，それぞれの分析パラメータを独立に制御することにより規則合成音を得る方法である。 " 広川. (1993). 規則合成における音声合成単位及び音声合成法 - より高品質を求めて. 日本音響学会誌 49巻, 12号. pp. 847-853.

marktechpost.com

OuteTTS-0.1-350M Released: A Novel Text-to-Speech (TTS) Synthesis Model that Leverages Pure Language Modeling without External Adapters Marktechpost Media 2024年11月4日

mindspring.com

Smithsonian Speech Synthesis History Project (SSSHP) 1986-2002

nii.ac.jp

ci.nii.ac.jp

徳田, 恵一 (2017). “風雲急を告げる音声合成研究の最新動向”. 情報・システムソサイエティ誌 (電子情報通信学会) 21 (4): 10–11. doi:10.1587/ieiceissjournal.21.4_10. ISSN 2189-9797. NAID 130005312792.
河井, 恒; 戸田, 智基; 山岸, 順一; 平井, 俊男; 倪, 晋富; 西澤, 信行; 津崎, 実; 徳田, 恵一 (2006). “大規模コーパスを用いた音声合成システムXIMERA”. 電子情報通信学会論文誌 J89-D (12): 2688–2698. ISSN 18804535. NAID 110007380404.

nikkan-gendai.com

“受け入れ態勢は？「筆談ホステス」当選の北区議会に聞いた”. 日刊ゲンダイDIGITAL. 2018年11月28日閲覧。

nuance.com

whatsnext.nuance.com

“Remembering Stephen Hawking’s iconic synthesized voice” (英語). What’s next. (2018年3月19日) 2018年11月28日閲覧。

nytimes.com

query.nytimes.com

http://query.nytimes.com/search/query?ppds=per&v1=GERSTMAN%2C%20LOUIS&sort=newest Louis Gerstmanの死亡記事（NYタイムス）

robohon.com

“ロボホン”. robohon.com. 2018年11月28日閲覧。

robotstart.info

「テレビの歴史で初となる、全キャラクターが音声合成でしゃべるアニメがスタート | ロボスタ - ロボット情報WEBマガジン」『ロボスタ』。2018年11月28日閲覧。

scitation.org

asa.scitation.org

"Formant synthesis versus articulatory synthesis" Klatt. (1979). Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Am. 67(3).

semanticscholar.org

"Unit selection synthesis is also referred as corpus based synthesis." Kayte. (2015). A Review of Unit Selection Speech Synthesis. IJARCSSE.

sharp.co.jp

“エモパー｜機能・サービス｜AQUOS ZETA SH-01G｜製品ラインアップ｜AQUOS：シャープ”. シャープスマートフォン・携帯電話　AQUOS公式サイト. 2018年11月28日閲覧。

stanford.edu

ccrma.stanford.edu

"A formant synthesizer is a source-filter model in which the source models the glottal pulse train and the filter models the formant resonances of the vocal tract." Smith. (2010). Formant Synthesis Models. Physical Audio Signal Processing. ISBN 978-0-9745607-2-4
"Constrained linear prediction can be used to estimate the parameters ... more generally ... directly from the short-time spectrum" Smith. (2010). Formant Synthesis Models. Physical Audio Signal Processing. ISBN 978-0-9745607-2-4

synsig.org

“Blizzard Challenge 2018 - SynSIG” (英語). www.synsig.org. 2018年11月30日閲覧。

townnews.co.jp

「防災無線が機械音声に 11月１日から本格開始 | 厚木 | タウンニュース」『タウンニュース』2016年11月11日。2018年11月28日閲覧。

toyota.jp

CORPORATION., TOYOTA MOTOR. “トヨタ KIROBO mini | KIBO ROBOT PROJECT | KIROBO・MIRATA | トヨタ自動車WEBサイト”. トヨタ KIROBO mini | KIBO ROBOT PROJECT | KIROBO・MIRATA | トヨタ自動車WEBサイト. 2018年11月28日閲覧。

trafficnews.jp

「ハイウェイラジオのヒミツ　情報の早さ、エリアの細かさ、その仕組みは？ | 乗りものニュース」『乗りものニュース』。2018年11月28日閲覧。

voicetext.jp

“VoiceTextホーム | HOYA音声合成ソフトウェア”. HOYA音声合成ソフトウェア「VoiceText」. 2018年11月28日閲覧。

wikipedia.org

de.wikipedia.org

KI-Sprachforschungsteam von Mozilla macht allein weiter （ドイツ語） Golem.de（ドイツ語版） 2021年3月15日

worldcat.org

search.worldcat.org

徳田, 恵一 (2015). “統計的音声合成技術の現在・過去・未来”. 音声言語シンポジウム IEICE-115 (346). ISSN 0913-5685.
徳田, 恵一 (2017). “風雲急を告げる音声合成研究の最新動向”. 情報・システムソサイエティ誌 (電子情報通信学会) 21 (4): 10–11. doi:10.1587/ieiceissjournal.21.4_10. ISSN 2189-9797. NAID 130005312792.
Andrew J., Hunt; Black, Alan W. (1996). “Unit selection in a concatenative speech synthesis system using a large speech database” (English). 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings (IEEE): 373–376. doi:10.1109/ICASSP.1996.541110. ISBN 0-7803-3192-3. ISSN 1520-6149.
河井, 恒; 戸田, 智基; 山岸, 順一; 平井, 俊男; 倪, 晋富; 西澤, 信行; 津崎, 実; 徳田, 恵一 (2006). “大規模コーパスを用いた音声合成システムXIMERA”. 電子情報通信学会論文誌 J89-D (12): 2688–2698. ISSN 18804535. NAID 110007380404.
Masuko, Takashi; Keiichi, Tokuda; Takao, Kobayashi; Satoshi, Imai (1999-05-09). “Speech synthesis using HMMs with dynamic features” (English). 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings (IEEE): 389–392. doi:10.1109/ICASSP.1996.541114. ISBN 0-7803-3192-3. ISSN 1520-6149.
Zen, Heiga; Senior, Andrew; Schuster, Mike (2013-05-26). “Statistical parametric speech synthesis using deep neural networks” (English). 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (IEEE): 7962–7966. ISBN 978-1-4799-0356-6. ISSN 1520-6149.