Methodology
DebateMetrics condenses language-observable patterns in Bundestag debates into two metrics: Discourse Quality (DQ) and Rhetorical Behaviour (RB).
Source material consists of plenary protocols of the German Bundestag. Each sitting of the Bundestag is recorded as a plenary protocol / stenographic report and made publicly available by the Bundestag, generally as PDF or XML files. DebateMetrics extracts and processes this material for analysis; resulting text segments, metadata corrections, annotations, and scores are not official Bundestag publications.
The scores do not say whether a political position is correct. They describe how a fraction argues, structures claims, uses evidence, and engages with other positions in the detected speech contributions.
DQ summarizes how clear, substantive, topic-relevant, and argumentatively traceable contributions are.
RB summarizes how respectful, clear, and cooperative contributions are and how strongly attacks, avoidance, or polemics appear.
LLMs are used here as consistent, independent annotators. The prompts forbid external fact checking and content truth evaluation so that comparable language patterns are scored instead of political agreement.
Du bist ein unabhängiger, unparteiischer Politikwissenschaftler.
All DQ dimensions range from 0.0 to 1.0. The aggregated DQ score is the contribution-length weighted average of the nine dimensions.
DQ = weighted average of 9 dimensions, weighted by contribution length.RB also uses scores from 0.0 to 1.0. Problematic dimensions are inverted before averaging so higher total scores always mean better rhetorical behaviour.
RB = weighted average of 1 minus attack, 1 minus aggression, respect, 1 minus avoidance, 1 minus polemics, clarity, and cooperation.The filter view compares fractions across selected transcripts, chapters, and providers. The details view shows timelines, provider differences, and the evidence behind individual scores.