Google is testing a new API that uses machine learning models to offer real-time language translation for inputted text and to make it easier to translate web pages.
According to a proposal spotted by Bleeping Computer, the feature is being developed by Chrome's built-in AI team and is aimed at exposing the web browser's built-in translation functionality and the ability to download additional language models to translate text.
While Chrome and Edge already have built-in translation features, they can sometimes have issues translating web pages that have dynamic or complex content. For example, Chrome may not be able to translate all sections of an interactive website correctly.
In such cases, websites have their own translation tool, which could consume a lot of resources and can be slow compared to the browser's built-in feature. Google plans to give developers access to Chrome's upcoming AI-based translation feature through a simple JavaScript API.
"To perform translation in such cases, web sites currently have to either call out to cloud APIs, or bring their own translation models and run them using technologies like WebAssembly and WebGPU," explains Google.
"This proposal introduces a new JavaScript API for exposing a browser's existing language translation abilities to web pages, so that if present, they can serve as a simpler and less resource-intensive alternative." explains Google.
The new AI/API feature would simplify this process by using Chrome's built-in translation engine for existing content or with inputted text, such as a forum or real-time chat service.
The feature will download a machine learning model to correctly translate the text if the required translation model is not built into the browser.
The proposal, as it stands, currently has some privacy issues, as it could potentially allow a website to fingerprint a user based on the browser's supported languages and availability status.
"The most obvious identifier in the current API design is the list of supported languages, and especially their availability status ("no", "readily", or "after-download")," warns Google.
"For example, as of the time of this writing Firefox supports 9 languages, which can each be independently downloaded. With a naive implementation, this gives 9 bits of identifying information, which various sites can all correlate."
The developers outlined potential mitigations, such as grouping language packs to reduce the number of bits or only exposing a fixed set of languages based on a user's locale.
The proposal also mentions allowing developers to determine if the translation is done on-device or using cloud services so they can make sure no sensitive text is sent to a third-party service, which could cause data leaks.
It's unclear when the feature would be added to Chrome, but Google is actively exploring and discussing the idea with the open-source community.
Comments
U_Swimf - 1 month ago
What's the difference between screen recording or event logging every little thing we do... VERSUS... translating every piece of contextual writing on a page in order to assertain what anyone is reading/looking for?
I think they should consider taking a step back.. Especially con personas qué no quiero uno Grande Hermano veo mi.
powerspork - 1 month ago
Honestly, this data is pale compared to the data already collected by them. If you open gmail in the web UI, they read your mail. Initially, they read your mail whether you did that or not, but that was struck down in court. Google maps, if on your phone, knows your location well enough to guess where you are going to go to next (stalker's wet dream). Thankfully Chrome can only get away with so much in a non-Google environment, but given most of what people do now is in a browser, that doesn't matter much.
If you have an Android phone, I would wager Google surveillance greatly surpasses "normal" NSA surveillance, especially in ways that normally require a warrant. Then again, they collude with the NSA anyways, so this is a non-point.
https://digitalcontentnext.org/wp-content/uploads/2018/08/DCN-Google-Data-Collection-Paper.pdf
That report is old. They may have new capabilities.
NoneRain - 1 month ago
Google trackers on websites already track everything you do, they really don't need this to track your behavior online. I think data collected from live-chats and such things would be so small and irrelevant compared to everything else, they wouldn't even bother, especially if the language models are running locally.... but who knows.
GT500 - 1 month ago
I use Vivaldi, and I turn off its automatic webpage translation feature. I don't like the idea of translation services knowing everything I'm trying to read.