Global Information Lookup Global Information

Adversarial stylometry information


Adversarial stylometry is the practice of altering writing style to reduce the potential for stylometry to discover the author's identity or their characteristics. This task is also known as authorship obfuscation or authorship anonymisation. Stylometry poses a significant privacy challenge in its ability to unmask anonymous authors or to link pseudonyms to an author's other identities, which, for example, creates difficulties for whistleblowers, activists, and hoaxers and fraudsters. The privacy risk is expected to grow as machine learning techniques and text corpora develop.

All adversarial stylometry shares the core idea of faithfully paraphrasing the source text so that the meaning is unchanged but the stylistic signals are obscured. Such a faithful paraphrase is an adversarial example for a stylometric classifier. Several broad approaches to this exist, with some overlap: imitation, substituting the author's own style for another's; translation, applying machine translation with the hope that this eliminates characteristic style in the source text; and obfuscation, deliberately modifying a text's style to make it not resemble the author's own.

Manually obscuring style is possible, but laborious; in some circumstances, it is preferable or necessary. Automated tooling, either semi- or fully-automatic, could assist an author. How best to perform the task and the design of such tools is an open research question. While some approaches have been shown to be able to defeat particular stylometric analyses, particularly those that do not account for the potential of adversariality, establishing safety in the face of unknown analyses is an issue. Ensuring the faithfulness of the paraphrase is a critical challenge for automated tools.

It is uncertain if the practice of adversarial stylometry is detectable in itself. Some studies have found that particular methods produced signals in the output text, but a stylometrist who is uncertain of what methods may have been used may not be able to reliably detect them.

and 9 Related for: Adversarial stylometry information

Request time (Page generated in 0.7615 seconds.)

Adversarial stylometry

Last Update:

Adversarial stylometry is the practice of altering writing style to reduce the potential for stylometry to discover the author's identity or their characteristics...

Word Count : 3782

Stylometry

Last Update:

nor can non-identification be guaranteed; adversarial stylometry's practice itself may be detectable. Stylometry grew out of earlier techniques of analyzing...

Word Count : 6679

Pseudonym

Last Update:

improved analytic techniques and text corpora. Authors may practice adversarial stylometry to resist such identification. Businesspersons of ethnic minorities...

Word Count : 6436

Anonymous social media

Last Update:

who may be identifiable by writing style; in turn, they may use adversarial stylometry to resist such identification. Apps such as Formspring, Ask, Sarahah...

Word Count : 2694

Anonymity

Last Update:

corpora grow. Authors may resist such identification by practicing adversarial stylometry. When it is necessary to refer to someone who is anonymous, it is...

Word Count : 6075

Textual entailment

Last Update:

obvious predictions. Textual entailment also has applications in adversarial stylometry, which has the objective of removing textual style without changing...

Word Count : 1454

Anonymous blog

Last Update:

bloggers can tentatively be deanonymized. This is known as stylometry; adversarial stylometry is the study of techniques for resisting such stylistic identification...

Word Count : 1524

Anonymous post

Last Update:

coffee shop, and hence cannot be traced to the individual user. Adversarial stylometry can be employed to resist identification by writing style. Another...

Word Count : 2460

Automatic summarization

Last Update:

reference to TL;DR − Internet slang for "too long; didn't read". Adversarial stylometry may make use of summaries, if the detail lost is not major and the...

Word Count : 6825

PDF Search Engine © AllGlobal.net