About
I founded Jina AI in 2020 and have been leading it as CEO ever since. Previously, I led neural search at Tencent between 2018-2020 and worked on search and recommendation systems at Zalando Research between 2014-2018, where I created Fashion-MNIST (11,000+ citations). I got my Ph.D. from TU Munich in 2014, focusing on adversarial and robust non-parametric Bayesian learning.
I’ve lived and worked in many places, including the San Francisco Bay Area, Berlin, Munich, Taipei, Beijing, and Shenzhen. I’m currently based in Mountain View.
Selected Publications
- Efficient Code Embeddings from Code Generation Models, 2025.8 - arXiv
- jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval, 2025.6 - arXiv
- ReaderLM-v2: Small Language Model for HTML to Markdown and JSON, 2025.3 - ICLR 2025 SCI-FM Workshop
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark, 2024.12 - ACL 2025
- jina-clip-v2: Multilingual Multimodal Embeddings for Text and Images, 2024.12 - ICLR 2025 SCI-FM Workshop
- jina-embeddings-v3: Multilingual Embeddings With Task LoRA, 2024.9 - ECIR 2025 Industry Track
- Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models, 2024.9 - SIGIR 2025 RobustIR Workshop
- Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever, 2024.8 - EMNLP 2024 Multilingual Representation Learning Workshop
- Jina CLIP: Your CLIP Model Is Also Your Text Retriever, 2024.5 - ICML 2024 MFM-EAI Workshop
- Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings, 2024.2 - arXiv
- Jina Embeddings 2: 8192-Token General-Purpose Text Embeddings for Long Documents, 2023.10 - arXiv
- Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models, 2023.7 - EMNLP 2023 NLP-OSS Workshop
- Dual ask-answer network for machine reading comprehension, 2018.9 - arXiv
- Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms, 2017 (> 11K Citations) - arXiv
- Support vector machines under adversarial label contamination - Neurocomputing
- Efficient Online Sequence Prediction with Side Information, 2013 - IEEE ICDM 2013
- Lazy Gaussian Process Committee for Real-Time Online Regression, 2013 - AAAI 2013
- Learning from Multiple Observers with Unknown Expertise, 2013 - PAKDD 2013
- Adversarial Label Flips Attack on Support Vector Machines, 2012 - ECAI 2012
- Evasion Attack on Multi-Class Linear Classifier, 2012 - PAKDD 2012
- Supervised Topic Transition Model for Detecting Malicious System Call Sequences, 2011 - SIGKDD Workshop 2011 (Best paper award)
- Toward Artificial Synesthesia: Linking Pictures and Sounds via Words, 2010 - NIPS Workshop 2010
- Efficient Collapsed Gibbs Sampling For Latent Dirichlet Allocation, 2010 - ACML 2010
Selected Media Coverages
- The Wall Street Journal, 2025.5 - The Tech Industry Is Huge—and Europe’s Share of It Is Very Small
- Deutsche Welle, 2024.10 - Making it in Germany: AI expert Han Xiao
- WIRED, 2024.10 - The Hottest Startups in Berlin in 2024
- MIT Technology Review, 2024.5 - Multimodal: AI’s new frontier
- arte, 2024.2 - Documentary: Smart New World - The AI Race
- German Embassy in China, 2023.11 - Innovation Dialogue
- TechCrunch, 2021.11
- Forbes, 2021.4 - AI DACH 30
- Handelsblatt, 2021.3 - Der Hype um KI-Start-ups ist vorbei – jetzt kommt es auf Qualität an
- Interview by Floydhub, 2020.6, The Future of AI is Open
- Guest post at LFAI, 2020.5, All-In Open Source: Why I Quit Tech Giant and Found My OSS Startup
- Interview: 200 Days as a Board Member in LF AI Foundation, QBit/量子位
- GNES opensource and Tencent OSPO presentation, OSS Summit EU, Lyon France
- Keynote at Darwin’s Circle, Vienna, Austria
- Panel Discussion (moderated by Christopher Keller from the Telegraph) at Darwin’s Circle, Vienna, Austria
- Interviewed by the Nature Magazine
- Interviewed by Alex Williams from the New Stack
- Keynote at Applied Artificial Intelligence Conference 2019, Vienna, Austria, invited by the Economic Chamber of Austria
- KubeCon + CloudNativeCon China 2019: AI/ML Media & Analyst Roundtable, Shanghai, China
- Opening talk & keynote at the annual meeting of German-Chinese Association of AI 2019, Berlin, Germany
- Interviewed by the derbrutkasten (an Austrian tech-focused media)
- Interviewed by OÖNachrichten (an Austrian local newspaper)
- Interviewed by Die Chinesische Handelskammer in Deutschland (德国中国商会)
- Internal talks at Amazon, SAP, HERE, OLX, Axel-Springer, Zalando, Carmeq, Porsche Digital Lab, appliedAI, etc. mostly in Germany Berlin and Munich area, 2018-2019