Keyword Highlighting
The GDPR distinguishes between regular personal data and special categories of personal data (also known as sensitive personal data). Regular personal data consists of, for instance, name, address, telephone number, social security number, and any other data that could directly or indirectly identify a person.
Sensitive personal data, on the other hand, is divided into nine categories, listed below:
The GDPR prohibits the processing of such personal data unless one of the exceptions listed under the GDPR is met:
Keyword Highlighting can be used to highlight certain terms, phrases or patterns in the documents which might be important.
ZyLAB’s Keyword Highlighting feature allows for easy identification of keywords or terms which may be significant to your matter. When a keyword highlighting rule key term is defined and, in turn, highlighted in the document, the reviewer is alerted of the term. Unlike search term hits, which are only highlighted when searched, once a keyword highlighting rule is created, it is applied automatically to the entire dataset in terms associated with the rule that is highlighted in the document viewed.
Keyword Highlighting can therefore be used to, for instance, highlight any sensitive or regular personal data that are present in the documents. If you would search for certain documents belonging to the requester, then these terms would be automatically highlighted in the document, which allows for a better overview. It is also possible - and advised - to use Insights to extract such sensitive or regular personal data.
Define a keyword rule that will find and highlight terms in a specific color (not the default yellow).
- On the Home page, select Configuration:
- Select Keyword Highlighting:
- You will see a list of all keyword rules that have already been created. These can be deleted or edited.
To create a new one, select Create New Item - Define the Name.
- Define one or more Keyword Queries.
-
Click
Query Syntax Help
For more information—refer to ZyLAB One Search Language Guide.
Use predefined macros, like #amount# or #date#. A "Placeholder #xxx# is not found" message is shown when the macro is not recognized.
- Fuzzy
finds variations of a term.Example: dutch~1, will find dutch, ditch, duich, durch, etc. -
Question Mark ?
matches a single character.Example: wom?n, will find woman and women. -
Asterisk *
matches zero or more characters.Example: *most, will find most and almost. - AND
finds related terms and narrows your search.Example: president AND america, will only find files with both terms. - OR
finds one or all terms and broadens your search.Example, car OR transportation, will find files with only the term car or transportation, and files with both terms. - NOT
allows you to exclude terms.Example: NOT apple, will find all files that do not contain the term apple. -
TO
finds occurrences of a term/query falling between two other terms/queries.Example: dear TO sincerely {John}, will find the term John falling between dear and sincerely. -
Withinfinds related terms within a defined range.Example: Tom w/2 John, will find the term Tom within two positions (terms) from John (Tom knows John because Tom is his brother).
-
Precedesfinds preceding terms within a defined range.Example: live p/2 work, will find ‘live to work’ but not ‘work to live’.
-
Number Rangefinds numbers or number ranges.Example: (>1 : <10), will find all values between 1 and 10.
-
Quorumfinds a specified number of terms from a list.Example: 1 of {blue, green, red}, will find at least one or more colors from the list in a file.
-
Character Patterns []matches a single character and/or range that is contained within the brackets.Example: m[a-z]n, will find all terms with three letters that start with an m and end with an n.
-
Negation [^]negates a single character and/or range that is contained within the brackets.Example: [^b]pple, will find apple, but also –pple, cpple, dpple, cpple, etc. But not bpple.
-
Preceding Element +
matches the preceding element one or more times.Example: ab+c, will find abc, abbc, abbbc, abbbbc, etc. But not ac. -
Preceding Element {m,n}
matches the preceding element at least m and not more than n times.
Example: ba{2,3}b, will find baab, baaab, but not bab, baaaab. -
Preceding Element {m}
matches the preceding element exactly m times.Example: [0-9]{3}-[0-9]{4}, will find local phone number 123-4567. -
Preceding Element {m,}
matches preceding element at least m times.Example: ba{0,}b, will find bb, bab, baab, baaab, etc.
Use Case Example
Enron adds a new Keyword Highlighting rule to highlight terms relating to special category data, such as religious or political beliefs. This can be useful for finding sensitive personal data, which are listed under the GDPR.
It is essential to distinguish between these different sorts of personal data and to ensure that processing is justified (if such personal data are being collected and stored by your company).
The following queries can be used to find sensitive personal data, either by searching or by using keyword highlighting:
Race / Ethnic Origin |
3 of {Black, white, Caucasian, Latino, Hispanic, Negroid, Capoid, Mongoloid, Australoid, Haitian, Asian, African American, American Indian, Alaska Native, Native Hawaiian, Pacific Islander} |
Political Opinions |
3 of {Politics, left wing, right wing, monarchy, democracy, dictatorship, federation, confederation, supreme court, communism, corruption, social democracy, libertarianism, authority, Christian democracy, political freedom, anarchy, republic, conservative, political parties, political election, president, king, queen, vice president, dictator, socialism, anti-communism, revolution, legislation, regime, vote, voting, rights, representative democracy, parliament, parliamentary system, presidential system, constitutional democracy, prime minster, tyrant, tyranny} |
Trade Union Membership
|
3 of {collective bargaining agreement, collective agreement, collective labo?r agreement, International Trade Union Confederation, ITUC, Confédération Syndicale Internationale, Confederación Sindical International, CSI Building “and” Wood Workers International, BWI, Education International, EI, International Affiliation “of” Writers Guilds, IAWG, International Arts “and” Entertainment Alliance, IAEA, International Federation “of” Chemical, Energy, Mine “and” General Workers Unions, ICEM, International Federation “of” Journalists, IFJ, International Metalworkers Federation, IMF, International Transport Workers Federation, ITF, International Textile, Garment “and” Leather Workers Federation, ITGLWF, International Union “of” Food, Agricultural, Hotel, Restaurant, Catering, Tobacco “and” Allied Workers Association, IUF, Public Services International, PSI, Union Network International, UNI}
|
Genetic Data
|
3 of {genetic, genetics, genetic data, genetic tests, genetic test, family medical history, medical history, genetic service* request, genetic service* receipt, fetus, embryo} |
Biometric Data
|
3 of {retina, nerve layer, thin layer, tissue, eye, sensory membrane, fingerprint, Finger*, Index, Forefinger, Thumb, Pointer, Plain Arch, Tented Arch, Ulnar loop, Radial loop, Double loop, Plain whorl, Central pocket loop whorl, Accidental whorl, faceprint, face, facial recognition, facial features, hand, hand palm, palm, simian crease, simian, crease, life line, head line, heart line, girdle "of" venus, sun line, mercury line, fate line, vein*, vein* pattern*, voice, voice recording, record, documentation, record-keeping, memo, voice memo} |
Health Data
|
3 of {Doctor, physician, physiotherapist, physiotherapy, health insurance, health, general practitioner, cancer, sick, ill, diseased, dead, death, diseased, morgue cardiologist, surgeon, nurse, nursery, hospital, first aid, emergency room, patient, general practitioner, family physician, GP, pediatrician, ophthalmologist, company doctor, company physician, patient data, release form hospital, medial specialist, nursing home, absenteeism, sick leave, blood pressure, resuscitate, ambulance, trauma helicopter, forensic doctor, dermatologist, dietician, gynecologist, pregnancy, miscarriage, abortion, low blood pressure, high blood pressure, blood sugar, oncologist, diagnose, x-ray, medicine, prescription, pharmacy, clinic intensive care, IC, healthy, obese, COVID-19, hernia, occupational physician, incapacitated, HIV, blood type, depression} |
Sexual Orientation
|
3 of {Hetero, homosexual, straight, gay, homo, bisexual, queer, transgender, intersex, pansexual, demisexual, asexual, androphilia, gynephilia, bi-curious, non-heterosexual, lesbian, gay sexual, trans, gender nonconforming, nonbinary, genderqueer, gender fluid, gender neutral, male assigned at birth, female assigned at birth, unassigned at birth, allosexual, skoliosexual, bi omnisexual} |
Criminal Past
|
3 of {Judge, criminal, jail prison, police, attorney, lawyer, defense attorney, court, courtroom, prosecutor, public defense, sentence, ruling, court clerk, law office, law, file number, file, legal decision, jurisprudence, suspect, victim, probation, parole, parole officer, prison sentence, community service, fine appeal, higher appeal, supreme court, court “of” justice, court “of” law, court “of” appeal arrest, custody, perjury, crime, felony, felon, violation, offence, public prosecutor, unconditional sentence, correctional institution, plea, probationary period, criminal record, charge, indictment, acquittal, punishment, misdemeanor, fine, penalty, breach} |
Religion
|
3 of {God, Jehovah, Yahweh, Jew, Judaism, heaven, hell, Satan, lucifer, devil, angel, Beelzebub, Allah, prophet, apostle, sin, Christianity, Catholic, Jehovah witnesses, kingdom hall, church, Holy ghost, Bible, (Word w/2 God), demon, gospel, Watchtower, Christ, Jesus Christ, Quran, Qoran, archangel, Sunnah, Mecca, Torah, mosque, synagogue, Islam, Jannah, Jahannam, Ramadan, Easter, prayer, fasting, Jihad, Old testament, New testament, crucifixion, resurrection, baptism, Calvinist, Evangelical, Lutheran, Hail Mary, (Day w/3 Judgement), Armageddon, rabbi, Hinduism, Buddhism, buddha, Nirvana, rebirth, reincarnation, meditation, karma, Temple, Sikh, Buddhist, Hindu, Muslim, Christian, Jainism} |