The Current #7

How AI is Reshaping Translation and Language Learning

by Hunter WorlandJul 17, 2024

The Current is a new series from NEA on the developments impacting consumer technology. Each installment examines a trend, disruption, or opportunity with consumer data. Posts are concise, informative, and always current.

In the late 19th century, an ophthalmologist from a polyglottic outpost of the Russian Empire invented Esperanto – a common second language that would eliminate linguistic barriers. Its community of esperantists dreamed their language could facilitate international commerce and forge a “a common brotherhood” between nations. The language gained millions of speakers but ultimately did little to remove the linguistic friction between markets. Can AI apps?

Linguistic friction is as old as civilization itself, echoing the ancient myth of Babel, where humanity's unified language shattered into multiple tongues (The Tower of Babel, Pieter Bruegel the Elder, 1563)

To examine this, we leveraged our consumer survey panel to investigate LLM translation efficacy, challenges among current language learners, and relative value between applications. Our research was deliberately outside the Anglosphere. English’s role as the global language makes it a less interesting market since the need for translation or language learning is inherently lower. As a case study, we used Germany – not only the largest market in Europe but also one with one of the highest incidence of English-language professional and educational requirements.

I. Language exam

We pitted the reigning champion of consumer application translation, Google Translate (which leverages its own form of AI, neural machine translation), against OpenAI's GPT-4o and Anthropic's Claude 3.5 Sonnet. Our judges were bilingual consumers fluent in both German and English.

The respondents selected the most natural translation across three passages: an idiomatic text, a technical text, and a conversational text.

The two LLMs essentially matched Google Translate across the passages with no statistically significant performance gap. Yet, a three-way tie for Google Translate is really a loss. If underlying translation technology is held equal, generative AI platforms certainly win on product experience as they leverage contextual understanding to provide more nuanced, coherent, and adaptable translations than Google Translate's primarily statistical neural machine translation. approach.

II. Pain Points

Our second survey targeted German learners of any foreign language. Surprisingly, the most significant pain point for language learners wasn't traditional pedagogical challenges like spelling or reading comprehension, but speech, including accent and pronunciation. Similarly, the largest unlock they envisioned was increased interaction with native speakers.

This presents a natural opportunity for generative AI disintermediation, both in product experience and underlying technology. The models can dynamically generate contextually appropriate language samples, simulating the diversity of real-world interactions. This capability allows for the coverage of a broader range of scenarios, including niche and informal use cases that often elude traditional pedagogical frameworks.

Pursuit of the opportunity will require:

  • Leveraging granular user data to create hyper-personalized learning pathways. For instance, optimizing instruction for a German B2 speaker requiring fluency in Canadian English accounting vocabulary for a short-term internship with frequent C-suite (i.e., formal language) presentations

  • Building and continually updating specialized language datasets for professional and regional dialects, say Canadian English accounting

  • Developing intelligent systems to dynamically route translation and learning tasks to the most proficient LLMs for specific language pairs and domains, addressing performance disparities only exacerbated by geofencing regulations

III. Applications

The application of underlying technology, by my account, falls into two (over-generalized) buckets: enhanced learning versus enhanced translation. It's a choice between augmenting human capability or outsourcing it entirely.

Our data, however, reveals a surprising twist. Contrary to the path of least resistance, consumers favor better learning over automation and more frictionless communication. An overwhelming majority of our respondents expressed a greater willingness to pay a subscription for a learning platform with 2x efficacy rather than a 2x translation platform.

Here are a few early-stage businesses at the frontier of AI language learning:

  • Our findings underscore the importance of hyper-personalized learning experiences. Promova addresses this need by offering a customizable AI-enhanced platform that goes beyond conventional adaptivity. By tailoring its approach to diverse learning styles and specific challenges like dyslexia, Promova creates an inclusive learning environment that resonates with our survey respondents' desire for personalized instruction

  • Speak harnesses AI to create an immersive tutoring experience, addressing crucial challenges like accent and pronunciation while fostering the native speaker interactions that our data shows learners value.

  • Our research emphasized the demand for speech. Univerbal rises to this challenge with its AI-powered tutor, facilitating unscripted dialogues across 21 languages and offering a flexible, accessible alternative to traditional learning methods.

  • Praktika's innovative use of AI avatars demonstrates the potential for technology to create more natural learning environments. By incorporating nuanced tones and emotional inflections, Praktika has developed a highly engaging approach to language acquisition, as evidenced by its rapidly growing user base of 1.2 million monthly active users.

  • For younger learners, personalized and interactive experiences are particularly crucial. Buddy AI caters to this demographic with its AI-powered English tutor, blending expert-crafted lesson plans with playful, voice-based interactions.

Reach out to hworland@nea.com continue the conversation.