"Evaluating Spülenposition gegen Wasseranschlussabstand" and "Analysierend die Platzierungskonflikte und Rohrleitungszwänge klären" are examples of generated summaries.
I wish someone could explain this - because I see no way that sentences like these would show up in training data. But maybe the RL for thinking has some quirks?
u know:
Thinking.... 1hr 3m - 89k tokens . thought 12s.