DingTalk and Tongyi Lab Launch Industry-Specific Speech Recognition Large Model Fun-ASR
On August 22, DingTalk and the speech team at Tongyi Lab jointly released a new generation of large-scale speech recognition model, Fun-ASR. This model accurately recognizes professional terminology across ten industries including interior construction and animal husbandry, and supports customized training for enterprise-specific models. Through deep collaboration, Fun-ASR efficiently transcribes various audio signals with capabilities in multi-industry terminology understanding, multilingual accent recognition, and contextual semantic reasoning.
Currently integrated into features such as subtitles and interpretation in DingTalk Meetings, smart meeting summaries, and voice assistant functions, Fun-ASR aims to build a stable, efficient, and easily scalable speech recognition foundation particularly suitable for enterprise scenarios demanding high accuracy and contextual comprehension.
Core Technology Highlights: Three Key Capabilities Ensuring High-Precision Recognition
Trained on over one billion hours of audio data and co-developed using real-world scenario data from multiple industries—including internet, technology, interior construction, animal husbandry, and automotive—provided by DingTalk, Fun-ASR significantly enhances its ability to recognize industry-specific terms.
Benchmark tests show an 18% improvement in recognition accuracy in the insurance industry and a 15%-20% increase in sectors like construction and animal husbandry. The model also supports enterprise-defined hotwords, allowing import of more than 1,000 custom vocabulary entries to improve recognition of rare or niche terms.
Fun-ASR can optimize inference using internal enterprise information such as contact lists, schedules, and knowledge bases within DingTalk, effectively reducing hallucinations in large models (with proper authorization) and delivering more reliable transcription results.
Leveraging an efficient end-to-end architecture, the model further refines algorithms using actual voice data provided by enterprises, improving recognition accuracy for proprietary content such as brand names, project codes, product names, and personal names.
For example, after enterprise-specific training, the model precisely identifies complex phrases like "Belgian imported Pulse latex" and "proprietary Sonocore foaming process" for KUKA Home, laying a solid foundation for subsequent customer demand analysis.
Future Outlook: Deepening Industry Adaptation Continuously
Li Xiangang, head of the speech team at Tongyi Lab, said: "We look forward to collaborating with DingTalk to drive innovative applications of speech recognition technology in enterprise settings. We will continue expanding the data volume and model scale of Fun-ASR, enhancing the replicability of solutions, and delivering smarter, more efficient experiences for businesses."
Zhu Hong, CTO of DingTalk, noted: "Through just three months of close collaboration, we achieved model deployment and earned recognition from leading customers—an important breakthrough toward industry leadership, and a replicable blueprint for other enterprises seeking customized large models."
The potential of Fun-ASR continues to be explored. Both parties will focus on upgrading capabilities in dialect recognition, noise-resistant performance, multilingual support, and deeper enterprise customization, comprehensively improving the precision and practicality of speech transcription to empower more enterprises in their intelligent transformation journey.
We dedicated to serving clients with professional DingTalk solutions. If you'd like to learn more about DingTalk platform applications, feel free to contact our online customer service or email at
Using DingTalk: Before & After
Before
- × Team Chaos: Team members are all busy with their own tasks, standards are inconsistent, and the more communication there is, the more chaotic things become, leading to decreased motivation.
- × Info Silos: Important information is scattered across WhatsApp/group chats, emails, Excel spreadsheets, and numerous apps, often resulting in lost, missed, or misdirected messages.
- × Manual Workflow: Tasks are still handled manually: approvals, scheduling, repair requests, store visits, and reports are all slow, hindering frontline responsiveness.
- × Admin Burden: Clocking in, leave requests, overtime, and payroll are handled in different systems or calculated using spreadsheets, leading to time-consuming statistics and errors.
After
- ✓ Unified Platform: By using a unified platform to bring people and tasks together, communication flows smoothly, collaboration improves, and turnover rates are more easily reduced.
- ✓ Official Channel: Information has an "official channel": whoever is entitled to see it can see it, it can be tracked and reviewed, and there's no fear of messages being skipped.
- ✓ Digital Agility: Processes run online: approvals are faster, tasks are clearer, and store/on-site feedback is more timely, directly improving overall efficiency.
- ✓ Automated HR: Clocking in, leave requests, and overtime are automatically summarized, and attendance reports can be exported with one click for easy payroll calculation.
Operate smarter, spend less
Streamline ops, reduce costs, and keep HQ and frontline in sync—all in one platform.
9.5x
Operational efficiency
72%
Cost savings
35%
Faster team syncs
Want to a Free Trial? Please book our Demo meeting with our AI specilist as below link:
https://www.dingtalk-global.com/contact

English
اللغة العربية
Bahasa Indonesia
Bahasa Melayu
ภาษาไทย
Tiếng Việt 