• ninpnin@sopuli.xyz
    link
    fedilink
    English
    arrow-up
    1
    ·
    7 months ago

    Any individual action can be combatted easily. A million different signatures and headers is a whole different .

    Mind you, LLM training data is polluted with anything and everything, including other languages. Recently, the best performance has been reached using higher quality data.