Can Large Language Models (or Humans) Disentangle Text?

de Pieuchon, Nicolas Audinet; Daoud, Adel; Jerzak, Connor Thomas; Johansson, Moa; Johansson, Richard

Computer Science > Computation and Language

arXiv:2403.16584 (cs)

[Submitted on 25 Mar 2024 (v1), last revised 3 May 2024 (this version, v2)]

Title:Can Large Language Models (or Humans) Disentangle Text?

Authors:Nicolas Audinet de Pieuchon, Adel Daoud, Connor Thomas Jerzak, Moa Johansson, Richard Johansson

View PDF

Abstract:We investigate the potential of large language models (LLMs) to disentangle text variables--to remove the textual traces of an undesired forbidden variable in a task sometimes known as text distillation and closely related to the fairness in AI and causal inference literature. We employ a range of various LLM approaches in an attempt to disentangle text by identifying and removing information about a target variable while preserving other relevant signals. We show that in the strong test of removing sentiment, the statistical association between the processed text and sentiment is still detectable to machine learning classifiers post-LLM-disentanglement. Furthermore, we find that human annotators also struggle to disentangle sentiment while preserving other semantic content. This suggests there may be limited separability between concept variables in some text contexts, highlighting limitations of methods relying on text-level transformations and also raising questions about the robustness of disentanglement methods that achieve statistical independence in representation space.

Comments:	To appear as: Nicolas Audinet de Pieuchon, Adel Daoud, Connor T. Jerzak, Moa Johansson, Richard Johansson. Can Large Language Models (or Humans) Disentangle Text? In: Sixth Workshop on NLP and Computational Social Science at NAACL, 2024
Subjects:	Computation and Language (cs.CL)
MSC classes:	68T50
ACM classes:	I.2.7; H.1.2
Cite as:	arXiv:2403.16584 [cs.CL]
	(or arXiv:2403.16584v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.16584

Submission history

From: Connor Jerzak [view email]
[v1] Mon, 25 Mar 2024 09:51:54 UTC (55 KB)
[v2] Fri, 3 May 2024 14:04:19 UTC (80 KB)

Computer Science > Computation and Language

Title:Can Large Language Models (or Humans) Disentangle Text?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can Large Language Models (or Humans) Disentangle Text?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators