{"id":804,"date":"2019-11-02T19:22:01","date_gmt":"2019-11-02T22:22:01","guid":{"rendered":"https:\/\/www.blogs.unicamp.br\/zero\/?p=804"},"modified":"2022-05-23T17:22:58","modified_gmt":"2022-05-23T20:22:58","slug":"messy-translations","status":"publish","type":"post","link":"https:\/\/www.blogs.unicamp.br\/zero\/804\/","title":{"rendered":"Messy Translations"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"804\" class=\"elementor elementor-804\" data-elementor-settings=\"{&quot;ha_cmc_init_switcher&quot;:&quot;no&quot;}\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-b9412ca elementor-section-boxed elementor-section-height-default elementor-section-height-default jltma-glass-effect-no\" data-id=\"b9412ca\" data-element_type=\"section\" data-e-type=\"section\" data-settings=\"{&quot;_ha_eqh_enable&quot;:false}\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-10e4057 jltma-glass-effect-no\" data-id=\"10e4057\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-13d4536 jltma-glass-effect-no elementor-widget elementor-widget-heading\" data-id=\"13d4536\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\"><a href=\"https:\/\/www.blogs.unicamp.br\/zero\/2019\/11\/02\/traducoes-baguncadas\/\">(Traduzir)<\/a><\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-2a0b316 jltma-glass-effect-no elementor-widget elementor-widget-text-editor\" data-id=\"2a0b316\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p align=\"justify\">In this text we will discuss the super power that automatic translations have to mess up the text and how this is the secret so that \u201clazy\u201d people (people who copy the work of their friends) can get away with being caught by similarity checkers.<\/p><p align=\"justify\">A similarity checker is software used to identify how similar a text is to others already published. This is a resource used to evaluate academic works and to judge if the similarities are natural or if it is the result of plagiarism. For almost a year I was responsible for verifying similarities and helping to correct course completion works, helping to prevent similarities from appearing or even being interpreted as plagiarism\u2026 this taught me something very interesting, that automatic translations can be used for clean up texts and thus avoid them being so similar.<\/p><p align=\"justify\">The reason that automatic translators mess up the text is not a software problem, but a language problem\u2026 to understand this, let&#8217;s analyze the set of words that make up a language as the Domain of a function and their respective meanings as the Image of that function . Thus, for each word &#8220;blablublau&#8221;, a meaning in that language is associated, for example &#8220;the effect of osteoporosis on Jamaican raccoons&#8221;, which can also have this exact meaning for another language, associated with the word &#8220;parapapapapa&#8221;.<\/p><p align=\"justify\">Thus, if in language X there are M words-meanings and in language Y there are N words-meanings. Of the words of language X, if there are L words with meanings equal to language Y, that is, if to translate from language X to language Y or the other way around, there are L words whose translations will be exact.<\/p><p align=\"justify\">The translation in this case will be accurate, since for each word in the Domain of one language, we generate an Image (meaning) which is also the Image of a word in the Domain of another language. Thus, if the word is the \u201ccode\u201d that the language uses to define the meaning, we can say that for both languages, these L words are equal to less than the \u201ccode\u201d used as a label, for example:<\/p><p align=\"center\"><i>portugu\u00eas \u2192 quadrado \u2192 \u25a1 \u2190 square \u2190 ingl\u00eas<\/i><\/p><p align=\"justify\">However, the mess of machine translations between X and Y is the result of M-L or N-L words. For if an Image of the Domain of the words of language X does not have an Image associated with language Y, then it is necessary to combine two or more words-meanings of language Y to be equivalent to the word-meaning of language X, for example:<\/p><p align=\"center\"><i>portugu\u00eas \u2192 anteontem || day before yesterday \u2190 ingl\u00eas<\/i><\/p><p align=\"justify\">Back in the sense of Domain and Image, we will call D (xi) the Domain of a word of the language X, I(x<sub>i<\/sub>) the Image of a word of the language X, I(y<sub>j<\/sub>) a Image of a word of the language Y, and D(y<sub>j<\/sub>) the Domain of a word of the language Y. Thus, for a first translation we will have:<\/p><p align=\"center\"><i>D(x<\/i><sub><i>0<\/i><\/sub><i>) = I(x<\/i><sub><i>0<\/i><\/sub><i>) = I(y<\/i><sub><i>1<\/i><\/sub><i>) + I(y<\/i><sub><i>2<\/i><\/sub><i>) + \u2026 + I(y<\/i><sub><i>N<\/i><\/sub><i>) = D(y<\/i><sub><i>1<\/i><\/sub><i>) + D(y<\/i><sub><i>2<\/i><\/sub><i>) + \u2026 + D(y<\/i><sub><i>N<\/i><\/sub><i>).<\/i><\/p><p align=\"center\"><i>(Pt) anteontem \u2192 (En) day before yesterday \u2192 (Pt) dia antes de ontem<\/i><\/p><p align=\"justify\">Thus, if any of the N meanings of the Image of language Y, is not among the L common words of the two languages, for example the Image of the word y<sub>k<\/sub>, we will have in the reverse process of translation from language Y to language X the following structure:<\/p><p align=\"center\"><i>D(y<\/i><sub><i>1<\/i><\/sub><i>) + D(y<\/i><sub><i>2<\/i><\/sub><i>) + \u2026 + D(y<\/i><sub><i>k<\/i><\/sub><i>) + \u2026 + D(y<\/i><sub><i>N<\/i><\/sub><i>) =<br \/><\/i><i>=<\/i><i> I(y<\/i><sub><i>1<\/i><\/sub><i>) + I(y<\/i><sub><i>2<\/i><\/sub><i>) + \u2026 I(y<\/i><sub><i>k<\/i><\/sub><i>) + \u2026 + I(y<\/i><sub><i>N<\/i><\/sub><i>) =<br \/><\/i><i>=<\/i><i> I(x<\/i><sub><i>1<\/i><\/sub><i>) + I(x<\/i><sub><i>2<\/i><\/sub><i>) + \u2026 I(x<\/i><sub><i>k1<\/i><\/sub><i>) + I(x<\/i><sub><i>k2<\/i><\/sub><i>) +\u2026 + I(x<\/i><sub><i>kN<\/i><\/sub><i>) + \u2026 + I(x<\/i><sub><i>N<\/i><\/sub><i>) = D(x<\/i><sub><i>1<\/i><\/sub><i>) + D(x<\/i><sub><i>2<\/i><\/sub><i>) + \u2026 D(x<\/i><sub><i>k1<\/i><\/sub><i>) + D(x<\/i><sub><i>k2<\/i><\/sub><i>) +\u2026 + D(x<\/i><sub><i>kN<\/i><\/sub><i>) + \u2026 + D(y<\/i><sub><i>N<\/i><\/sub><i>).<\/i><\/p><p align=\"justify\">In this way, for each word-meaning that does not present the same Image in the Domain of that language, we generate in translation at least two other words with the intention of explaining its meaning. But if one of these words that explains the meaning, does not present the same Image in the Domain of the other language, to reverse the translation, we will generate at least two other words that explain its meanings. The process goes on in that machine translation mess in which we blame the innocent software.<\/p><p align=\"justify\">Now let&#8217;s do an experiment that demonstrates how these messy translations can be used to avoid getting caught in similarity detectors even when copying the little friend&#8217;s text. We will do this with the text of our little friend William Shakespeare, take six verses from a well-known text, &#8220;to be or not to be&#8221;.<\/p><p align=\"left\"><em>To be, or not to be, that is the question:<\/em><br \/><em>Whether &#8216;tis nobler in the mind to suffer<\/em><br \/><em>The slings and arrows of outrageous fortune,<\/em><br \/><em>Or to take arms against a sea of troubles<\/em><br \/><em>And by opposing end them. To die\u2014to sleep,<\/em><br \/><em>No more; and by a sleep to say we end (William Shakespeare).<\/em><\/p><p align=\"justify\">Now let&#8217;s move it to Portuguese and then back to English.<\/p><p align=\"left\"><em>To be or not to be, that&#8217;s the question:<\/em><br \/><strong><u>If it is<\/u><\/strong><em> nobler in the mind to suffer<\/em><br \/><em>The slings and arrows of outrageous fortune,<\/em><br \/><em>Or to take <\/em><strong><u>up<\/u><\/strong><em> arms against a sea of troubles<\/em><br \/><em>And <\/em><strong><u>as opposed to finish<\/u><\/strong><em> them. To die &#8211; to sleep,<\/em><br \/><em>No <\/em><strong><u>longer<\/u><\/strong><em>; and <\/em><strong><u>for<\/u><\/strong><em> a sleep to say <\/em><strong><u>that we broke up<\/u><\/strong><em> (probably from William Shakespeare)<\/em><\/p><p align=\"left\">A relatively simple process, but the bold and underlined words have been changed from the initial text. Now a slightly bolder experiment. We will pass the original text from English \u2192 French \u2192 Greek \u2192 Russian \u2192 Japanese \u2192 Swahili \u2192 Hungarian \u2192 English, and with this we arrive at the following result that we marked in bold and underlined the parts that were changed:<\/p><p align=\"left\"><em>The question<\/em><strong><u> is, are there:<\/u><\/strong><br \/><strong><u>Be gentle with bitter ghosts<\/u><\/strong><br \/><strong><u>Fantastic site<\/u><\/strong><u> <\/u><em>and arrows<\/em><br \/><em>Or <\/em><strong><u>his hand<\/u><\/strong><em> against <\/em><strong><u>the<\/u><\/strong><em> sea <\/em><strong><u>in question<\/u><\/strong><br \/><strong><u>On the contrary, it is complete. <\/u><\/strong><em>Die<\/em><strong><u>-bed<\/u><\/strong><br \/><strong><u>She <\/u><\/strong><em>no<\/em><strong><u> longer sleeps.<\/u><\/strong><em> (unknown author)<br \/><\/em><\/p><p align=\"justify\">Reading the text, we realize that it is very different from the original, in fact it is another text. But in a sense it is related to that initial and the most incredible thing is that it was generated in a fully automatic way. In fact, this process distorts the original text enough to partially preserve its meaning while making the new text original.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-e8f6130 elementor-section-boxed elementor-section-height-default elementor-section-height-default jltma-glass-effect-no\" data-id=\"e8f6130\" data-element_type=\"section\" data-e-type=\"section\" data-settings=\"{&quot;_ha_eqh_enable&quot;:false}\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-1fd4303 jltma-glass-effect-no\" data-id=\"1fd4303\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-efadfbc jltma-glass-effect-no elementor-widget elementor-widget-heading\" data-id=\"efadfbc\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\"><a href=\"https:\/\/www.blogs.unicamp.br\/zero\/\">Back to main page<\/a><\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-571aec2 elementor-section-boxed elementor-section-height-default elementor-section-height-default jltma-glass-effect-no\" data-id=\"571aec2\" data-element_type=\"section\" data-e-type=\"section\" data-settings=\"{&quot;_ha_eqh_enable&quot;:false}\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-4bb7d67 jltma-glass-effect-no\" data-id=\"4bb7d67\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-07c7e7a jltma-glass-effect-no elementor-widget elementor-widget-heading\" data-id=\"07c7e7a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h4 class=\"elementor-heading-title elementor-size-default\"><a href=\"https:\/\/www.blogs.unicamp.br\/zero\/who-writes-the-posts\/\">Who write the posts?<\/a><\/h4>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Why do automatic translations mess up the text so much and how can we use it to our advantage?<\/p>\n","protected":false},"author":434,"featured_media":811,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"colormag_page_container_layout":"default_layout","colormag_page_sidebar_layout":"default_layout","_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"pgc_sgb_lightbox_settings":"","_vp_format_video_url":"","_vp_image_focal_point":[],"footnotes":""},"categories":[1209],"tags":[],"class_list":["post-804","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-v-2-ed-2"],"jetpack_featured_media_url":"https:\/\/www.blogs.unicamp.br\/zero\/wp-content\/uploads\/sites\/187\/2019\/11\/shakespeare.png","_links":{"self":[{"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/posts\/804","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/users\/434"}],"replies":[{"embeddable":true,"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/comments?post=804"}],"version-history":[{"count":9,"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/posts\/804\/revisions"}],"predecessor-version":[{"id":1735,"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/posts\/804\/revisions\/1735"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/media\/811"}],"wp:attachment":[{"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/media?parent=804"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/categories?post=804"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.blogs.unicamp.br\/zero\/wp-json\/wp\/v2\/tags?post=804"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}