{"id":85238,"date":"2024-06-04T15:13:47","date_gmt":"2024-06-04T13:13:47","guid":{"rendered":"https:\/\/phrase.com\/?p=85238"},"modified":"2024-11-06T17:44:30","modified_gmt":"2024-11-06T16:44:30","slug":"rethinking-tm-threshold","status":"publish","type":"post","link":"https:\/\/phrase.com\/blog\/posts\/rethinking-tm-threshold\/","title":{"rendered":"Time to Rethink How Much We Depend on That \u201c70% TM Threshold\u201d Rule"},"content":{"rendered":"\n<div id=\"acf\/text-block_5a7dacf45ea0430c5cc210645b466038\" class=\"pxblock pxblock--text alignfull spacing--default bg--white\">\n\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wysiwyg animate-in\">\n\t\t\t<p><span style=\"font-weight: 400;\">In my <\/span><a href=\"https:\/\/phrase.com\/blog\/posts\/unveiling-optimization-secrets-through-analytics\/\"><span style=\"font-weight: 400;\">previous post<\/span><\/a><span style=\"font-weight: 400;\">, we looked into how optimization through analytics can boost our translation productivity. We highlighted two key findings: the need to tweak our suboptimal TM Threshold settings and an eye-opening insight that improving the quality of translations produced by the TM pre-translation by just 1% can cut down segment editing time by a whole 3 seconds. Today, we&#8217;ll pick up right where we left off and dive into the blind chart we teased at the end of our last post.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Today, with labels:<\/span><\/p>\n\t\t<\/div>\n\t<\/div>\n<\/div>\n\n\n\n<div id=\"acf\/text-block_cc2445bf7f475839ce375dc09a9c91be\" class=\"pxblock pxblock--text alignfull spacing--default bg--white\">\n\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wysiwyg animate-in\">\n\t\t\t<figure id=\"attachment_85239\" aria-describedby=\"caption-attachment-85239\" style=\"width: 1024px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-85239 size-large\" src=\"https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/fuzzy-match-1024x633.png\" alt=\"\" width=\"1024\" height=\"633\" srcset=\"https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/fuzzy-match-1024x633.png 1024w, https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/fuzzy-match-300x186.png 300w, https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/fuzzy-match-768x475.png 768w, https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/fuzzy-match.png 1200w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption id=\"caption-attachment-85239\" class=\"wp-caption-text\">The blue chart illustrates the average editing time of a segment relative to its TM Fuzzy score. The red chart illustrates the average editing time of a segment relative to its QPS score.<\/figcaption><\/figure>\n\t\t<\/div>\n\t<\/div>\n<\/div>\n\n\n\n<div id=\"acf\/text-block_75e2370f402acdcb7342f3c9d8e0f1ce\" class=\"pxblock pxblock--text alignfull spacing--default bg--white\">\n\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wysiwyg animate-in\">\n\t\t\t<p><span style=\"font-weight: 400;\">Before we dive into interpreting the chart, it&#8217;s crucial to understand the distinct nature of the two scores. Let\u2019s first break down what these scores really mean, how they interact, and why they are critical for our optimization strategies. By doing so, we can better appreciate the insights the chart offers and make informed decisions to enhance our translation workflows.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">TM fuzzy matching is the helpful elf in Santa&#8217;s workshop. It scours through the TM, spotting a source segment that&#8217;s kind of like what you&#8217;ve already translated, and goes, &#8220;Hey, this is the closest you&#8217;ve translated so far.&#8221; But unless that score hits a perfect 100 or a magical 101, it&#8217;s not the exact match, and it mostly likely needs a linguist&#8217;s touch to polish it up. So, while TM fuzzy matching is super helpful\u2014it&#8217;s always just Santa&#8217;s helper. For the world of automation, it means you can\u2019t automate with TMs only.\u00a0<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Machine Translation (MT) is like aiming to be the big guy in red himself, striving to produce the final translation on its own. It has \u201chistorical matches\u201d in its training set, where they&#8217;re essentially baked into the neural network through machine learning. This brainy network can think outside the box, allowing it to translate new texts it hasn\u2019t encountered before. And that\u2019s the big difference from TMs that can\u2019t.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">TM Fuzzy matching is essentially a simple <\/span><i><span style=\"font-weight: 400;\">compare<\/span><\/i><span style=\"font-weight: 400;\"> algorithm, which can\u2019t produce final translation. But because we know precisely how it functions, we can confidently rely on the Fuzzy scores it generates. TM&#8217;s ability to deterministically produce a score we trust stems from its straightforward, predictable n-gram algorithm.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">MT has the capability to produce a final translation, which is pretty impressive. However, the catch is that we don\u2019t quite know how it came up with its results. The neural networks behind MT operate in ways that can sometimes feel like a black box, making it a bit of a mystery.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">That&#8217;s exactly why so much effort and resources have been devoted to developing a reliable algorithm to score MT outputs. At Phrase, this has been a pivotal research area for many years. The algorithm has evolved significantly over time, most notably replacing the original MTQE, which focused on estimating the edit distance needed to correct MT output. It has since shifted to Phrase Quality Performance Score (QPS), which incorporates human post-editing data and structured evaluations aligned with the industry-standard MQM framework.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Essentially, the <\/span><a href=\"https:\/\/phrase.com\/blog\/posts\/phrase-quality-performance-score\/\"><span style=\"font-weight: 400;\">Phrase QPS<\/span><\/a><span style=\"font-weight: 400;\"> is designed to mimic what you&#8217;d expect if a linguist were scoring translations based on the MQM framework. This means the score reflects a comprehensive, human-like assessment of translation quality, evaluating it with the same rigor and criteria a professional linguist would use. That\u2019s the type of data we train it on.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">How do we know the scores from Phrase QPS are reliable? This is where the chart we mentioned earlier comes into play. It demonstrates a solid correlation between the QPS scores of a segment and its editing time. It shows that higher scores require less intervention from linguists. Essentially, the better the score, the less the need for costly and time-consuming revisions. This kind of correlation is exactly what we want to see\u2014it confirms that the scoring algorithm is doing its job effectively. And despite being a NKOTB, it has its place as a trusted partner in our automation toolkit.<\/span><\/p>\n\t\t<\/div>\n\t<\/div>\n<\/div>\n\n\n\n<div id=\"acf\/text-block_c4c3de4d6bf7d903fd85069a43777226\" class=\"pxblock pxblock--text alignfull spacing--default bg--white\">\n\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wysiwyg animate-in\">\n\t\t\t<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-85246 size-large\" src=\"https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/QPS-1024x638.png\" alt=\"\" width=\"1024\" height=\"638\" srcset=\"https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/QPS-1024x638.png 1024w, https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/QPS-300x187.png 300w, https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/QPS-768x479.png 768w, https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/QPS-1536x957.png 1536w, https:\/\/phrase.com\/wp-content\/uploads\/2024\/06\/QPS.png 1600w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n\t\t<\/div>\n\t<\/div>\n<\/div>\n\n\n\n<div id=\"acf\/text-block_72a4bf7fe15c69f9a09ad7c3dfd06d3b\" class=\"pxblock pxblock--text alignfull spacing--default bg--white\">\n\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wysiwyg animate-in\">\n\t\t\t<p><span style=\"font-weight: 400;\">Looking further at the data, we see something pretty interesting: TM Fuzzy scores actually only beat out MT when the matches are perfect 100s or 101s. This really throws a wrench into that whole &#8220;70% TM Threshold is industry standard&#8221; thing we&#8217;ve been sticking to. It looks like that rule might not be holding up unless the matches are nearly perfect, which definitely puts that standard in a new, less flattering light. Maybe it&#8217;s time to rethink how much we depend on that 70% rule.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">If you&#8217;ve already ditched the 70% TM Threshold rule, then you might be ahead of the curve. But in <\/span><a href=\"https:\/\/phrase.com\/blog\/posts\/unveiling-optimization-secrets-through-analytics\/\"><span style=\"font-weight: 400;\">my last post<\/span><\/a><span style=\"font-weight: 400;\">, I highlighted how 70% remains the most commonly used threshold. This established norm has a hefty price tag associated with it.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Another point to note is that regardless of whether the initial translation was done by TM or MT, a segment that was poorly pre-translated typically requires about 20 to 25 seconds to fix.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Lastly, it&#8217;s important to note the steepness of the red line, which indicates a decrease of about 1 second of editing time per segment for every 1-point reduction in the QPS Threshold in TMS pre-translation. This suggests that slightly lowering the quality threshold could lead to significant time savings.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">How much could your organization save? Get ready for your journey to uncover the hidden treasures of cost-saving secrets! Discover the optimal TM and QPS Threshold values for your projects. The answers to these pivotal questions are just around the corner.<\/span><\/p>\n\t\t<\/div>\n\t<\/div>\n<\/div>\n\n\n\n<div id=\"acf\/text-block_7a3a182f265b272485a010d07bc0e601\" class=\"pxblock pxblock--text alignfull spacing--default bg--white\">\n\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wysiwyg animate-in\">\n\t\t\t<hr \/>\n<p>&nbsp;<\/p>\n\t\t<\/div>\n\t<\/div>\n<\/div>\n\n\n\n<div id=\"acf\/blog-cta-block_19b27e376b1415994ab8d7d14a1ec479\" class=\"pxblock pxblock--blog-cta alignfull bg--green image--orientation-square\">\n\t<div class=\"block-container\">\n\t\t\t\t<div class=\"content\">\n\t\t\t<h3 style=\"text-align: center;\">Join our expert-led webinar<\/h3>\n<p style=\"text-align: center;\"><span style=\"color: #1080fc;\"><strong>October 16, 2024 at 4:30 pm CEST | 10:30 am EDT<\/strong><\/span><\/p>\n<p style=\"text-align: center;\">Join us for a game-changing webinar where we challenge the industry\u2019s reliance on the 70% Translation Memory (TM) threshold for pre-translations. We\u2019ll reveal how leveraging machine translation for TM scores between 70-98% can boost translation productivity and speed up project timelines.<\/p>\n<p style=\"text-align: center;\"><a class=\"btn btn--primary\" href=\"https:\/\/phrase.com\/resources\/webinars\/breaking-the-70-barrier-tm-threshold-optimization\/\">Register now<\/a><\/p>\n\t\t<\/div>\n\t<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Discover how Advanced Analytics in Phrase TMS transforms business efficiency. Explore powerful dashboards, custom insights, and significant upcoming enhancements to streamline processes and take your efficiency to the next level.<\/p>\n","protected":false},"author":72,"featured_media":43487,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_stopmodifiedupdate":false,"_modified_date":"","_searchwp_excluded":"","footnotes":""},"categories":[39],"class_list":["post-85238","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-translation-management"],"acf":[],"_links":{"self":[{"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/posts\/85238"}],"collection":[{"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/users\/72"}],"replies":[{"embeddable":true,"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/comments?post=85238"}],"version-history":[{"count":10,"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/posts\/85238\/revisions"}],"predecessor-version":[{"id":92271,"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/posts\/85238\/revisions\/92271"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/media\/43487"}],"wp:attachment":[{"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/media?parent=85238"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/phrase.com\/wp-json\/wp\/v2\/categories?post=85238"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}