Appendix C: On Learning from User Feedback

B.1 The Intuition

FallacyTag invites feedback. When a user marks a tag as “incorrect,” that judgment is recorded. In theory, such input could help the system improve—by refining detection strategies, adjusting thresholds, or adapting to context over time. The intuition is familiar: learn from disagreement.

But in practice, learning from this kind of feedback is much harder than it looks.

B.2 Why Feedback Is Ambiguous

Not all rejections are equal—and not all are about reasoning. A user might dismiss a tag because:

It feels wrong for the tone or genre (e.g., casual podcast vs. formal debate)
They don’t like the interruption
They agree with the conclusion, and mistake critique for contradiction

In reasoning contexts, the line between “this is a fallacy” and “this rubs me the wrong way” is thin. Even sincere feedback can mislead if the signal is entangled with sentiment, ideology, or attention mode.

Beyond ambiguity, there’s distortion. In public contexts, feedback is shaped by:

Confirmation bias — users tend to protect reasoning they agree with
Social signaling — rejections often reflect tribal alignment, not logical appraisal
Moral loading — tags may be read as ideological attacks, not structural cues

In polarized environments, these effects can dominate. Tags get rejected not because they’re wrong, but because they’re unwelcome. In such settings, FallacyTag may not be misfiring—it may simply be misfit.

B.4 Strategies Considered (and Their Limits)

We explored multiple feedback-driven improvement strategies:

Strategy	Idea	Limitation
Rejection Clustering	Flag tags that get dismissed often	May reinforce local bias or majority opinion
Contextual Caching	Tune logic by media format or genre	Requires strong priors; risks overfitting
Prompt Chain Feedback	Feed rejections back into generation chains	Adds complexity without clear gains
Revision Tracing	Look for edits following dismissals	Sparse and difficult to correlate with feedback meaningfully
Cohort Modeling	Segment users by behavior for weighting	Requires scale and raises privacy/framing questions

Each approach offered a partial signal. None yielded a robust, general-purpose improvement loop.

B.5 Why the Loop Is Open—For Now

FallacyTag currently stores feedback but does not act on it. This is intentional.

In a system designed to support reasoning—not just user preference—learning from ambiguous or adversarial signals may do more harm than good. The system might shift toward comfort, not clarity.

The right path may involve:

Better modeling of user intent
New interaction patterns that disambiguate feedback
Feedback that arises from editing behavior, not just button taps

Until then, restraint is a feature, not a limitation. The system listens—but it does not contort.