Clear all filters
End user applications
Infrastructure and cloud
machine learning (50)
web development (27)
A free/open-source machine translation platform
natural language processing
human language technologies
Adopting the Hindi-Bengali language pair (unreleased language pair).
In this project, I aim to create a hin-ben repository in Apertium that also includes the task of creating/expanding the transfer rules, creating the...
Apertium Browser Plugin
My project has been to develop the Apertium Browser Plugin. The previous Geriaoueg plugin is out of date, with the official link given in the wiki...
Adopt an unreleased language pair, Hindi-Bhojpuri
I plan on developing the Bhojpur-Hindi language pair in both directions i.e. bho-hin and hin-bho. This will involve building a monolingual...
Implementing new language pair: Kazakh - Uzbek
Having seen the benefits of the open-source Rule-Based Machine Translation platform - Apertium as an alternative to other free/commercial online...
Ideas for Google Summer of Code/Morphological analyser
• Creating a high-accuracy morphological analyser for Ibo by contributing to the currently existing one; • Increasing WER on the eng-ibo pair...
User friendly lexical training
The procedure for lexical selection training is a bit messy, with various scripts involved that require lots of manual tweaking, and many third party...
3 mostly unrelated smaller projects that all happen to start with "uni": UNIcode, UNIt testing, and UNIversal dependencies transfer (the latter being...
A morphological analyzer for Bagvalal
Bagvalal is an endangered typologically rare Caucasian language from the Nakh-Daghestanian family. Its conservation and study are constrained by the...
Finnish, Olonets-Karelian and Karelian lexicon development
The three languages that this application targets are closely related Balto-Finnic languages spoken in geographical proximity to one another. Finnish...
Develop a prototype MT system for a strategic language pair uzb->kaa
In this project I'm going to continue developing translation pair Uzb-Kaa languages. In the list of different pairs of Turkic languages, I analyzed...