Home
Latest
Featured
Tags
Search
Search for Blog
Language model performance
2
Large Language Models
Factual Accuracy
QA Dataset Design
Knowledge Retrieval
Language Model Performance
•
4 Jul, 2024
Rethinking QA Dataset Design: How Popular Knowledge Enhances LLM Accuracy
By
Casey Madison
Large Language Models
Computer Science
CS-Bench
AI Evaluation
Language Model Performance
•
22 Jun, 2024
Beyond Boundaries: Evaluating Large Language Models in Computer Science with CS-Bench
By
Avery Parks