Huge models are trained on data in multiple languages, so I would like to use such model to detect the language of the input. I would extract paragraphs from a webpage, then have the AI analyze the text and spit out something like "the majority of the text is in English, small parts are in German and Swedish".
Is it a feasible application for an LLM? Or will a simple frequency analysis for language detection be more accurate and efficient?