What is Segmentation in Translation?


Segmentation in translation is the way to separate a source text into more modest units for translation. These units are arranged by picking specific segmentation decisions that fill in as a base for making and editing translation recollections, as per a picked language pair. These standards comprise a progression in translation robotization as frameworks’ figure out how’ to remember them, and they are naturally applied during the translation work process. 


With the ascent of computer-assisted translation tools, translators, commentators, and outsourcers have accessed software that significantly eliminates the time needed for translation-related errands through mechanization. Segmentation is one of the underlying advances that source content goes through before a translator begins the translation interaction. 


Today, we will be focusing on multiple aspects of segmentation in translation. For the ease of readers, we have divided the article into the following sections:


  • The language translation process
  • Segments and customer segmentation
  • The three main approaches to market segmentation
  • Segmentation approaches
  • Working on the segmentation process

The language translation process

Following are the steps involved in the language translation process:


Select the text

The initial step is to figure out the text you will translate. That is, the topic and substance, how long it is, the composing style, if it’s specialized, the different areas, and so forth. The translator will normally read or skim-read portions of the text to get an outline of the substance. They may note key ideas or phrasing they’ll have to explore and will choose if any starter foundation reading is required. Then, they’ll research and resolve how they’ll translate key terms prior to starting the translation.


Initial translation

Presently we deliberately translate the report, ordinarily in lumps of 5 – 10 words all at once. Picking the proper length of individual text pieces to deal with is significant. Ideally, each lump will be a discrete and complete unit of significance. Each piece likewise must be adequately short to hold in momentary memory. Anything over around ten words can be a battle. Sentences are oftentimes more than this, so they will commonly be separated into more limited units. Working with pieces that are too short or not discrete significance units will, in general, deliver an unnatural and conceivably indistinct translation. Then again, working with pieces that are excessively long to effectively recall risks some significance being missed in the translation.

Review the accuracy

After the primary draft is finished, the translator will, at that point, deliberately work through the translation, contrasting each piece of text and the source text. The essential objective here is to affirm they haven’t missed any substance or confounded any significance. Most translators will likewise identify and improve any marginally unnatural or inelegant phrasing in this progression.

Break is must

The following stage is extremely straightforward – set the translation aside and enjoy a reprieve. Ideally, this ought to be for a couple of hours or overnight. The thought is absolutely to clear the brain to guarantee a more compelling fifth and last advance.


Translation wording refining

In the last advance, the translator re-reads the translation, this time without reference to the source report, taking a gander at the nature of articulation. They’ll make last alters to additionally refine and “clean” the translated text.



With segmentation, the source content is separated into translation units called segments. These segments can be phrases, sections, list items, portrayals, titles, and so on. The segments are naturally created by specific segmentation rules. They help fabricate and modify the translation memory for a specific task or customer. Segmentation is a foundation arrangement for the future utilizing already existing translated content. Because of the created segments, the translator and commentator additionally see their assignment simplified. Remaining focused and recollecting the specific phrasing of a translated segment in a high-volume undertaking can be testing. Segmentation, and storage in a translation memory, help dispose of these translation issues by saving recently translated segments. 


With segmentation and the translation memory coming about because of already translated segments, the CAT tool will embed the current translation into context with precisely the same or halfway substance consequently supplanted. This is done through a coordinating interaction that creates matches inside certain match outlines.

Segmentation is an extraordinary resource inside a CAT tool that permits the translator to be more productive in translation. It is the base for the creation and editing of the translation memory for a language pair, customer, or venture. The center advantage is time saved looking for already translated sentences and other text units that are rehashed inside the text. Segmentation is a computerized cycle once the recently characterized rules are set up in the framework. 


Segmentation likewise separates specific units, recently translated, for future reuse, reducing expenses for customers by utilizing already translated content. Also, time reserve funds mean quicker venture turnarounds while improving translation consistency and quality. This aids the client fabricates an undertaking specific or general translation memory that can be utilized for any future translation project, disposing of excess translation.


The three main approaches to market segmentation

Following are the three main approaches to market segmentation:


A piori segmentation

It is the least complex methodology, utilizes a classification plot based on openly accessible attributes, for example, industry, and friends size, to make particular gatherings of customers inside a market. However, from the earlier, market segmentation may not generally be legitimate since organizations in a similar industry and of a similar size may have totally different needs.



Needs-based segmentation is based on differentiated, approved drivers (needs) that customers express for a specific item or service being advertised. The needs are found and verified through essential market exploration, and segments are separated based on those different needs as opposed to attributes, for example, industry or organization size.



Value-based segmentation differentiates customers by their monetary value, gathering customers with a similar value level into singular segments that can be unmistakably focused on.

Segmentation approaches

Following are the segmentation approaches in market segmentation:


The first is to keep planning of language area code to each segment. If you consider the big picture, each email must have a solitary language; well, there are cases for bilingual substance. However, that is a point for another blog entry. In light of this, we can plan each segment to a language. Presently this methodology may be somewhat excess on the grounds that the majority of your segments may be the equivalent. To construct your planning in a way that keeps it easy to refresh and allude (i.e., just incorporate non-English mappings, for instance). Likewise, remember an assignment for the segment name so you realize that you ought to allude to the planning. 

Another is to separate the substance from the email segmentation to permit language to be autonomously related. In this arrangement, we don’t have to make language segments except if we need to. Rather language can be dictated by the supporter’s data, and our email layouts get the related language content. I like this arrangement since now we can utilize language region codes and not need to stress over affecting our segmentation plan. However, for this answer for work, our email formats need to become *smart* since they need to deal with getting the correct language.


Working on the segmentation process

The CAT tool divides the text into sentences or sections as per the sort of file that you wish to measure. By and large, it is ideal for partitioning the text into sentences. The CAT tool identifies the sentences in the source text as indicated by a progression of accentuation marks: full stops, line breaks, outcry, and question marks, colons, and semi-colons. On account of tables and records, it is ideal for partitioning the text into passages. This cycle is critical as the CAT tool will look for these segments in the translation memory to decide if they have already been translated or not.


Following are the points that must be focused on when doing the segmentation process:


  • Try not to utilize section breaks to partition portions of the text in the archive. The CAT tool identifies the section break and, in this way, segments the text. If a section break is utilized to partition a title into two lines, for instance, the CAT tool will identify the title as two autonomous sentences. For the situation that this title has already been translated in the memory, the CAT tool would not identify it on the grounds that the source segment would not match the past translation. The expense of translation of that title would along these lines be higher. If you need to partition the title, do as such with a constrained line break as you would then be able to design the CAT tool not to isolate the sentence.
  • Use exclamation and question marks effectively. 
  • Guarantee that all sentences are composed with a full stop. 
  • Keep away from the utilization of shortenings. 
  • Try not to enter numbered records physically. Utilize the programmed highlight of the word processor. 
  • Guarantee that the sections have been applied effectively. 
  • Avoid the utilization of spaces.


As should be obvious, little subtleties can receive significant rewards. If you might want us to educate you on the composing regarding your specialized documentation or if you need to translate a record, kindly don’t spare a moment to get in touch with us. We would gladly assist you with lessening your translation costs.



This article first appeared on Harry Clark blog

Also, read Usability Strategies for Translating Technical Documentation