🏢
🌐 Entity
Truthfulness
Topics referred to by the same term
📊 Rating
1 news mentions · 👍 0 likes · 👎 0 dislikes
📌 Topics
- Machine Learning Safety (1)
- Language Model Evaluation (1)
- Unsupervised Learning (1)
🏷️ Keywords
Unsupervised Elicitation (1) · Language Models (1) · Easy-to-Hard Generalization (1) · Model Safety (1) · Evaluation Challenges (1) · Truthfulness (1) · AI Research (1)
📖 Key Information
Truthfulness may refer to:
Honesty—a moral character of a human being, related to telling the truth
Accuracy—the propensity of information to be correct
Incentive compatibility—a property of some strategic games that encourages participants to be honest about their preferences
See also:
Truth - a concept most often used to mean in accord with fact or reality.
Truthiness - a quality characterizing a "truth" that a person making an argument or assertion claims to know intuitively.
Truthlikeness - a philosophical concept that distinguishes between the relative and apparent truth and falsity of assertions and hypotheses.
📰 Related News (1)
-
🇺🇸 Three Concrete Challenges and Two Hopes for the Safety of Unsupervised Elicitation
arXiv:2602.20400v1 Announce Type: cross Abstract: To steer language models towards truthful outputs on tasks which are beyond human capability, previ...