Papers
arxiv:2605.24053

Breaking the Chains of Probability: Neutrosophic Logic as a New Framework for Epistemic Uncertainty in Large Language Models

Published on May 22
· Submitted by
Maikel Yelandi Leyva Vázquez
on May 28
Authors:

Abstract

Neutrosophic Logic is applied to large language models to better represent epistemic uncertainty and internal conflicts, revealing that hyper-truth states emerge spontaneously in ethical and logical contexts.

AI-generated summary

Large Language Models (LLMs) are predominantly governed by probabilistic frameworks in which the sum of outcome probabilities is constrained to unity. This architectural limitation, often imposed by Softmax layers, leads to a collapse of uncertainty that makes it difficult to differentiate between epistemic uncertainty, paradox, and vagueness. We present an empirical investigation of the application of Neutrosophic Logic, a framework that treats Truth (T), Indeterminacy (I), and Falsity (F) as three independent dimensions, to model epistemic states in LLMs. We conducted experiments on a family of four OpenAI GPT models across five linguistic phenomena: logical paradoxes, epistemic ignorance, vagueness, ethical contradictions, and future contingencies, under three prompting strategies: neutrosophic, probabilistic, and entropy-derived. Our findings reveal that the neutrosophic approach, by allowing T+I+F > 1, a state we term hyper-truth, provides a richer representation of a model's internal state. In 35% of evaluations, hyper-truth emerged spontaneously, predominantly under ethical contradiction and logical paradox. We demonstrate that this approach preserves truth values in fuzzy contexts and offers a robust method for identifying and quantifying internal model conflict. We conclude that the integration of neutrosophic evaluation layers is a critical step toward more transparent, reliable, and ethically aware AI systems.

Community

Paper author Paper submitter

TL;DR

This paper argues that the Softmax layer at the end of an LLM isn't just an implementation choice — it's an epistemological constraint. By forcing the probabilities of mutually exclusive outcomes to sum to 1, we collapse four distinct phenomena into the same format: ignorance, paradox, vagueness, and genuine moral conflict. The authors propose evaluating LLMs with neutrosophic logic (Truth, Indeterminacy, and Falsity as three independent dimensions in [0,1], with no sum constraint) and report that in 66% of unconstrained evaluations T + I + F > 1 — a regime they call hyper-truth that is structurally unreachable under probabilistic prompting (Proposition 1).

What I find interesting

The experiment is well controlled: same JSON output format under Strategy 1 and Strategy 2, the only thing that varies is the sum constraint communicated to the model. That cleanly isolates the effect of normalization from the effect of output format.
The largest representational gap appears exactly where it matters most for alignment: ethical contradiction (95% hyper-truth rate, ΔT = +0.267) and epistemic ignorance (ΔI = +0.383 — i.e., 38 percentage points of indeterminacy that normalization reroutes into confident assertions).
Mason (2026) independently replicated the effect across 5 additional vendors at 84%, so it doesn't look like an OpenAI-specific artifact.
The authors are explicit that this is a representational affordance, not a latent variable measured inside the model — I think that's the right framing and it avoids overclaiming.
Limitations worth flagging

Effective N = 100, with 5 stochastic replicates per cell — these are not independently sampled stimuli, so the Wilson CI and χ² should be read at the cell × repetition level.
Only 5 phenomena, one per class. Downstream validation is still open: does declared hyper-truth predict performance on trolley-problem reasoning, legal dilemma resolution, or paradox-tolerance benchmarks?
The plithogenic extension isn't pursued here, which is where Mason has been pushing and where Proposition 2 (non-injectivity of the scalar projection π) naturally points.
Question for the community

Does it make sense to think about a neutrosophic head as an optional post-Softmax evaluation layer — something a model could activate when it detects a domain sensitive to epistemic conflict (ethics, law, forecasting) — rather than replacing Softmax globally? Or does the audit/calibration cost of having two coexisting output formats make that impractical in production?

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2605.24053
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.24053 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.24053 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.24053 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.