Vertens

Automatic translation of i18n messages for your products with AI.

Large language models are particularly good at languages. Let's use them to quickly make our products fully localized for international audiences.

Features

Support multiple target languages as long as the LLM support
Idempotency:
- Only translated missing keys in target language, do nothing otherwise.
- Keys order is preserved as the input. This is important for version control.
Models: OpenAI GPT. Others coming soon.
Batch support to fit into LLM context size

Backlog

Support other file formats: properties, gettext

Installation

WIP

Basic usage

Before running vertens, you need have an OpenAI key configured as an environment variable OPENAI_API_KEY.

Given an input file in this format:

{
    "key1": "Message 1",
    "key2": "Message 2",
}

The following command translate input.json to fr, de languages. The result for each language is written into its own file in the output directory.

vertens --language fr --language de <path/to/input.json> <path/to/output_directory>

You can also view other parameters with help

vertens --help

Recipes

(React) i18next messages

i18next is a popular i18n solution, especially for ReactJS.

Vertens currently support only JSON format which has top level keys, like

{ "key1": "Message 1", "key2": "Message 2", }

but not deep nested keys, like:

{ "key1" : { "key11": "" } }

You can write a script to transform between these formats so that you can use with Vertens.

We can translate the top level keys format as:

vertens --language fr ./lang-en.json ./lang-fr.json

You can also specify a placeholder value to specify which messages to be translated if its key is already present in the target language file.

vertens --language fr --placeholder __STRING_NOT_TRANSLATED__ ./lang-en.json ./lang-fr.json

This is typically useful if you are using a tool like i18next-scanner which is very cool. It scans messages to translate or remove (if no longer used)

Translate multiple languages

A loop in Bash would serve this purpose

#!/usr/bin/env bash

for lang in fr de
do
    vertens --language $lang ./lang-en.json ./lang-$lang.json
done

Smoke runs

If you have a huge translation file, and you want to test the vertens without translating all of them, you can use the --sample-size parameter to pick only a small portion of the file to translate.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
vertens		vertens
.gitignore		.gitignore
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vertens

Features

Backlog

Installation

Basic usage

Recipes

(React) i18next messages

Translate multiple languages

Smoke runs

About

Releases

Packages

Languages

manhhavu/vertens

Folders and files

Latest commit

History

Repository files navigation

Vertens

Features

Backlog

Installation

Basic usage

Recipes

(React) i18next messages

Translate multiple languages

Smoke runs

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages