Wen-Yi (Wayne) Hsiao 蕭文逸
Emails: wenyi.hsiao@bytedance.com / s101062219@gmail.com

𝄆 CV | Google Scholar | Github | LinkedIn | Twitter 𝄇

蕭文逸(Wen-Yi Hsiao)


Hi, I am Wen-Yi Hsiao (蕭文逸).
I am a Research Scientist at ByteDance, Seed Team@San Jose, USA. 🇺🇸

I’m a researcher (1K+ citations) and programmer (1K+ GitHub stars), working to bridge the gap between theory and practice.
I work with teams to do research, create products, and organize exhibitions.
Meanwhile, I study and perform music on my own.
I love to work with people from diverse fields, where I believe true sparks happen.
I am also a piano lover 🐈.


  Education & Experience

sym

NTHU
國立清華大學
Computer Science (BS)
2012.09 - 2016.07

sym

NTHU
國立清華大學
Computer Science (MS)
2016.09 - 2018.07

sym

Academia Sinica
中央研究院
Research Assistant
2017.09 - 2018.11

sym

Taiwan AILabs
台灣人工智慧實驗室
Research Engineer
2019.04 - 2024.02

sym

Taiwan AILabs
台灣人工智慧實驗室
Sr. Research Engineer
2024.02- 2024.12

sym

ByteDance
字節跳動
Research Scientist
2025.02 - Present



Conference Proceedings

2024 ISMIR: “Musicongen: Rhythm and chord control for transformer-based text-to-music generation.”
Txt2music - Arxiv | Demo | GitHub (2nd Author)
2024 DAFx: “Hyper recurrent neural network: Condition mechanisms for black-box audio effect modeling.”
DSP / AFx - Arxiv | GitHub (2nd Author)
2022 ICASSP: “Towards automatic transcription of polyphonic electric guitar music”
Analysis - Arxiv | Dataset (2nd Author)
2022 ISMIR: “DDSP-based singing vocoders: A new subtractive-based synthesizer and a comprehensive evaluation.”
Vocoder - Arxiv | Demo | GitHub (co-1st Author)
2021 EUSIPCO: “Source separation-based data augmentation for improved joint beat and downbeat tracking.”
Analysis - Arxiv | GitHub (3rd Author)
2021 AAAI: “Compound word transformer: Learning to compose full-song music over dynamic directed hypergraphs.”
AI Music - Arxiv | GitHub | Slides (1st Author)
2020 ISMIR: “Automatic composition of guitar tabs by transformers and groove modeling.”
AI Music - Arxiv | Demo | Slides (3rd Author)
2020 MMSP: “Mixing-specific data augmentation techniques for improved blind violin/piano source separation.”
Separation - Arxiv | GitHub (2nd Author)
2018 AAAI: “Musegan: Multi-track sequential generative adversarial networks for symbolic music generation and accompaniment.”
AI Music - Arxiv | GitHub (co-1st Author)

Journals

2021 JNMR: “Automatic melody harmonization with triad chords: A comparative study.”
AI Music - Arxiv | GitHub (2nd Author)

Workshops

2019 ISMIR Late-Breaking: “Jamming with yating: Interactive demonstration of a music composition AI.”
HCI - Paper | Demo (1st Author)
2019 ISMIR Late-Breaking: “Learning to generate jazz and pop piano music from audio via MIR techniques.”
HCI - Paper (2nd Author)

GitHub Organization

Yating Music, Taiwan AI Labs
2019 April - 2024 December
[GitHub]

From 2019, as one of the core members of a research team at Taiwan, we open-sourced AI technologies for music before the dawn of the generative AI era.

Toolkits

miditoolkit (254 stars) - GitHub
toolkit - A python toolkit for handling MIDI I/O in ticks, the native time unit of the MIDI protocol.
Lead Sheet Dataset (119 stars) - GitHub
dataset -A web crawler code for lead sheets (melody&chord) from Hooktheory.
ReaRender (98 stars) - GitHub
toolkit - A python toolkit for automatic audio/MIDI rendering using REAPER.

Exhibitions

花蓮流行音樂AI實驗基地
2025 January
[Website]

Technical staff responsible for deploying text-to-video and text-to-music models in the exhibition. Located in Hualien, a beautiful town surrounded by mountains and sea.

教我如何做你的愛人 - 陳珊妮AI模型
2023 March
[YouTube]

Technical staff responsible for singing voice synthesis, collaborating with a famous Taiwanese pop singer, Sandee Chen (陳珊妮).

Award

Silver Award, 9th Merry Electronics Master Thesis Award, (4,500 USD) - Link.

Materials

Research & Work Experience Overview
2024 December
[PDF]

This presentation provides an overview of my research and professional experience up to 2024, along with my technical stack and the key industry trends over time.

My Research Notes on Audio Effect Modeling
2022 December
[PDF] | [Paper]

These slides document my experience in digital signal processing, audio effect modeling, and the applications of neural network in these domains.