cancel
Showing results forĀ 
Search instead forĀ 
Did you mean:Ā 
Generative AI
Explore discussions on generative artificial intelligence techniques and applications within the Databricks Community. Share ideas, challenges, and breakthroughs in this cutting-edge field.
cancel
Showing results forĀ 
Search instead forĀ 
Did you mean:Ā 

Best Practices for Multilingual Model Training: Single vs. Multi-Model for Translation

Maylin
New Contributor

Hello everyone,

Iā€™m working on a translation project involving documents up to 100 pages long, in 17 different languages, and I'm looking for the best approach to achieve high-quality translations in this multilingual context.

  1. Single model vs. multi-model approach

    • Is it better to use a single multilingual model or to train separate models for each source language?
    • If I go with a single model, is it possible to progressively add each new language by retraining the model multiple times without losing the ability to translate into previously trained languages?
    • Lastly, if Iā€™m using the same source language, can I train the model to translate into multiple target languages without needing a dedicated model for each source-target combination?
  2. Model
    Iā€™m planning to use Databricks to train the model, following the advice from this article: Fine-Tuning Large Language Models and leveraging Hugging Faceā€™s translation script: run_translation.py. Would this approach be effective for achieving quality translations across a wide range of languages?

  3. Using Databricks functions for common languages
    Databricks offers a built-in translation function (ai_translate), but it currently only supports translations between French, English, and Spanish. If one of these languages matches my translation requirements, would it make sense to prioritize this solution? Is it potentially more effective than tools like DeepL, which havenā€™t fully met my clientā€™s expectations?

Thanks in advance for any advice and insights on the best approach to take!

0 REPLIES 0

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonā€™t want to miss the chance to attend and share knowledge.

If there isnā€™t a group near you, start one and help create a community that brings people together.

Request a New Group