SC-Arena: A Natural Language Benchmark for Single-Cell Reasoning with Knowledge-Augmented Evaluation
#SC-Arena #Large Language Models #Single-cell biology #Evaluation benchmark #Knowledge-augmented evaluation #Virtual cell #Natural language tasks #Biological reasoning
📌 Key Takeaways
- SC-Arena introduces a unified evaluation framework for LLMs in single-cell biology
- The framework uses a 'virtual cell' abstraction to represent cellular attributes and interactions
- Five natural language tasks probe core reasoning capabilities in cellular biology
- Knowledge-augmented evaluation incorporates external biological knowledge for more accurate assessment
📖 Full Retelling
🏷️ Themes
Artificial Intelligence, Scientific Research, Evaluation Methods
📚 Related People & Topics
Large language model
Type of machine learning model
A large language model (LLM) is a language model trained with self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially language generation. The largest and most capable LLMs are generative pre-trained transformers (GPTs) that provide the c...
Cellular model
A cellular model or virtual cell is a computational model of aspects of a biological cell, for the purposes of in silico research. Developing such models has been a task of systems biology and mathematical biology. It involves developing efficient algorithms, data structures, visualization and commu...
Entity Intersection Graph
Connections for Large language model: