
gittech. site
for different kinds of informations and explorations.
i18n-ai-translate: automate i18n JSONs with ChatGPT/Gemini/Ollama
i18n-ai-translate
Leverage ChatGPT, Gemini, Ollama, or Claude for seamless translation of localization files. Supports directories of nested translation files. Requires i18next-style JSON files.
Three prompts are chained to ensure each translation is well-formed.
- The translation prompt attempts a translation
- The translation verification prompt uses a separate context to verify the translation
- The styling verification prompt uses yet another context to verify the translation's formatting is consistent with the source
History is retained between calls to ensure consistency when translating the entire file.
https://github.com/user-attachments/assets/4909bf01-3e7a-464a-9c6e-2d1b82cc47d0
- Usage
- Translation prompt
- Translation verification prompt
- Styling verification prompt
- Prompt overriding
Usage
Quick-start
GitHub Actions
Incorporate it into your CI with a GitHub Action to auto-translate keys for every pull request as a new commit. All configurable options available in action.yml.
The following translates every PR where i18n/en.json
has been modified:
name: i18n-ai-translate
on:
pull_request:
# Only trigger when en.json has been modified
paths:
- "i18n/en.json"
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: i18n-ai-translate
uses: taahamahdi/i18n-ai-translate@master
with:
json-file-path: i18n/en.json
api-key: ${{ secrets.OPENAI_API_KEY }}
Running directly
git clone [email protected]:taahamahdi/i18n-ai-translate.git
cd i18n-ai-translate
yarn
cp /home/en.json jsons/
# Generate French translations
npm run i18n-ai-translate -- translate -i en.json -o fr.json --engine chatgpt --model gpt-4o --api-key <openai_key>
Running as a script in your own project
yarn add i18n-ai-translate
# Generate French translations
npx i18n-ai-translate translate -i en.json -o fr.json --engine gemini --model gemini-2.0-flash-exp --api-key <gemini_key>
# Or, assuming you already have other translations in the current directory
npx i18n-ai-translate diff --before en-before.json --after en.json --input-language English --engine claude --model claude-3-5-sonnet-latest --api-key <anthropic_key>
Running as a library
import { translate } from "i18n-ai-translate";
...
const englishJSON = {
"welcomeMessage": "Welcome, {{name}}!",
"messages": {
"notifications_one": "You have one notification",
"notifications_other": "You have {{count}} notifications",
"delete": "Would you like to delete the \"{{name}}\" category?"
}
};
const frenchTranslation = await translate({
inputJSON: englishJSON,
inputLanguage: "English",
outputLanguage: "French",
...
});
console.log(frenchTranslation);
{
"welcomeMessage": "Bienvenue, {{name}} !",
"messages": {
"notifications_one": "Vous avez une notification",
"notifications_other": "Vous avez {{count}} notifications",
"delete": "Voulez-vous supprimer la catégorie « {{name}} » ?"
}
}
Script
Use i18n-ai-translate translate
to convert a local i18n JSON file to any language. Relative paths begin from the jsons/
directory.
Use i18n-ai-translate diff
to find the differences between two versions of a source language file, and apply them to all language files in the same directory.
Create a .env
file with an entry for your API key, or pass the --api-key
flag.
GEMINI_API_KEY=<your Gemini API key>
OPENAI_API_KEY=<your OpenAI API key>
ANTHROPIC_API_KEY=<your Anthropic API key>
For Ollama, create an entry for your host, use the --host
flag to set a custom host and path (Defaults to localhost:11434
).
OLLAMA_HOSTNAME=<the server and port number running Ollama>
Usage: i18n-ai-translate [options] [command]
Use ChatGPT or Gemini to translate your i18n JSON to any language
Options:
-V, --version output the version number
-h, --help display help for command
Commands:
translate [options]
diff [options]
help [command] display help for command
Usage: i18n-ai-translate translate [options]
Options:
-i, --input <input> Source i18n file or path of source language, in the jsons/ directory if a relative path is given
-o, --output-languages [language codes...] A list of languages to translate to
-e, --engine <engine> Engine to use (chatgpt, gemini, ollama, or claude)
-m, --model <model> Model to use (e.g. gpt-4o, gemini-2.0-flash-exp, llama3.3, claude-3-5-sonnet-latest)
-r, --rate-limit-ms <rateLimitMs> How many milliseconds between requests (defaults to 1s for Gemini, 120ms (at 500RPM) for ChatGPT, 1200ms
for Claude)
-f, --force-language-name <language name> Force output language name
-A, --all-languages Translate to all supported languages
-p, --templated-string-prefix <prefix> Prefix for templated strings (default: "{{")
-s, --templated-string-suffix <suffix> Suffix for templated strings (default: "}}")
-k, --api-key <API key> API key
-h, --host <hostIP:port> The host and port number serving Ollama. 11434 is the default port number.
--ensure-changed-translation Each generated translation key must differ from the input (for keys longer than 4) (default: false)
-n, --batch-size <batchSize> How many keys to process at a time (default: "32")
--skip-translation-verification Skip validating the resulting translation through another query (default: false)
--skip-styling-verification Skip validating the resulting translation's formatting through another query (default: false)
--override-prompt <path to JSON file> Use the prompts from the given JSON file, containing keys for at least one of generationPrompt,
translationVerificationPrompt, stylingVerificationPrompt
--verbose Print logs about progress (default: false)
--help display help for command
Usage: i18n-ai-translate diff [options]
Options:
-b, --before <fileOrDirectoryBefore> Source i18n file or directory before changes, in the jsons/ directory if a relative path is given
-a, --after <fileOrDirectoryAfter> Source i18n file or directory after changes, in the jsons/ directory if a relative path is given
-l, --input-language <inputLanguageCode> The input language's code, in ISO6391 (e.g. en, fr)
-e, --engine <engine> Engine to use (chatgpt, gemini, ollama, or claude)
-m, --model <model> Model to use (e.g. gpt-4o, gemini-2.0-flash-exp, llama3.3, claude-3-5-sonnet-latest)
-r, --rate-limit-ms <rateLimitMs> How many milliseconds between requests (defaults to 1s for Gemini, 120ms (at 500RPM) for ChatGPT, 1200ms for
Claude)
-k, --api-key <API key> API key
-h, --host <hostIP:port> The host and port number serving Ollama. 11434 is the default port number.
--ensure-changed-translation Each generated translation key must differ from the input (for keys longer than 4) (default: false)
-p, --templated-string-prefix <prefix> Prefix for templated strings (default: "{{")
-s, --templated-string-suffix <suffix> Suffix for templated strings (default: "}}")
-n, --batch-size <batchSize> How many keys to process at a time (default: "32")
--skip-translation-verification Skip validating the resulting translation through another query (default: false)
--skip-styling-verification Skip validating the resulting translation's formatting through another query (default: false)
--override-prompt <path to JSON file> Use the prompts from the given JSON file, containing keys for at least one of generationPrompt,
translationVerificationPrompt, stylingVerificationPrompt
--verbose Print logs about progress (default: false)
--help display help for command
Example usage
npx i18n-ai-translate translate -i en.json -o fr
- Translate the
en.json
file injsons/
to French, and save the output infr.json
npx i18n-ai-translate translate -i en.json -o es de nl --engine gemini
- Translate the
en.json
file injsons/
to Spanish, German, and Dutch, and save each file injsons/
, using Google Gemini
npx i18n-ai-translate diff -b en.json -a en-after.json -l English --verbose --engine ollama --host my-olllama-server.com:12345
- Translate the keys that have changed between
en.json
anden-after.json
for all files in theen.json
directory, with logging enabled using Ollama running onmy-ollama-server.com:12345
npx i18n-ai-translate translate -i en.json -A --engine chatgpt --model gpt-4-turbo --api-key <my_key> --rate-limit-ms 150 -n 64
- Translate the
en.json
file injsons/
to 200+ languages, save each file injsons/
, using the GPT-4 Turbo model of ChatGPT, with the given key, a rate limit of 150ms between requests, and 64 keys sent in each batch
npx i18n-ai-translate diff -b en -a en-after --engine claude
- Translate the keys that have changed between
en/
anden-after/
for all JSON files in both directories using Claude
As a library
Alternatively, import this project and use it to convert JSONs on-the-fly with translate()
, or use translateDiff()
to fetch updates to modified keys when your source i18n file has changed.
import { translate, translateDiff } from "i18n-ai-translate";
...
const translation = await translate({
engine, // ChatGPT, Gemini, Ollama, or Claude
model, // Model to use (e.g. gpt-4o, gemini-2.0-flash-exp, llama3.3, claude-3-5-sonnet-latest)
chatParams, // Additional configuration to pass to the model
rateLimitMs, // How many milliseconds between requests
apiKey, // OpenAI/Gemini/Anthropic API key
host, // The host and port number running Ollama
inputJSON, // JSON to translate
inputLanguage, // Language of inputJSON
outputLanguage, // Targeted language (e.g. French, Spanish, etc.)
templatedStringPrefix, // The start of inline variables; defaults to "{{"
templatedStringSuffix, // The end of inline variables; defaults to "}}"
verbose, // Print status of conversion to stdout/stderr
ensureChangedTranslation, // Every key longer than 4 characters must be different than the input
batchSize, // How many keys to process at a time
skipTranslationVerification, // Whether to skip an additional query to see whether the resultant translation makes sense
skipStylingVerification, // Whether to skip an additional query to see whether the text formatting remained consistent
overridePrompt, // Set custom prompts for generation or verification
});
const translations = await translateDiff({
engine, // ChatGPT, Gemini, Ollama, or Claude
model, // Model to use (e.g. gpt-4o, gemini-2.0-flash-exp, llama3.3, claude-3-5-sonnet-latest)
chatParams, // Additional configuration to pass to the model
rateLimitMs, // How many milliseconds between requests
apiKey, // OpenAI/Gemini/Anthropic API key
host, // The host and port number running Ollama
inputLanguage, // Language of inputJSONBefore/After
inputJSONBefore, // The source translation before a change
inputJSONAfter, // The source translation after a change
inputLanguage, // Language of inputJSONBefore/After
toUpdateJSONs, // An object of language codes to their translations
templatedStringPrefix, // The start of inline variables; defaults to "{{"
templatedStringSuffix, // The end of inline variables; defaults to "}}"
ensureChangedTranslation, // Every key longer than 4 characters must be different than the input
verbose, // Print status of conversion to stdout/stderr
batchSize, // How many keys to process at a time
skipTranslationVerification, // Whether to skip an additional query to see whether the resultant translation makes sense
skipStylingVerification, // Whether to skip an additional query to see whether the text formatting remained consistent
overridePrompt, // Set custom prompts for generation or verification
});
Translation prompt
Batches of the i18n input are passed in. Each call is checked to ensure no keys are lost, all templated strings are retained, and no translations were skipped.
You are a professional translator.
Translate each line from ${inputLanguage} to ${outputLanguage}.
Return translations in the same text formatting.
Maintain case sensitivity and whitespacing.
Output only the translations.
All lines should start and end with an ASCII quotation mark (").
${input}
Translation verification prompt
The output of the translation is sent back to ensure the model is okay with the translation. If this fails, the translation is re-generated.
Given a translation from ${inputLanguage} to ${outputLanguage} in CSV form, reply with NAK if _any_ of the translations are poorly translated.
Otherwise, reply with ACK.
Only reply with ACK/NAK.
Styling verification prompt
Formatting from the input should be retained where possible. If punctuation, capitalization, or whitespaces differ between the source and the translation, the translation is re-generated.
Given text from ${inputLanguage} to ${outputLanguage} in CSV form, reply with NAK if _any_ of the translations do not match the formatting of the original.
Check for differing capitalization, punctuation, or whitespaces.
Otherwise, reply with ACK.
Only reply with ACK/NAK.
Prompt overriding
Replace the aforementioned prompts with your own by creating a JSON file containing keys of at least one of generationPrompt
, translationVerificationPrompt
, or stylingVerificationPrompt
. Then, pass it as an argument with --override-prompt <path to file>
. Be sure to include templated arguments like ${inputLanguage}
as part of the prompt.