Benchmark Evaluating "small" LLM performance for content extraction tasks calendar Apr 7, 2026 - clock 12 min read