Using MemAlign to Improve Evaluation of Traditional Machine Learning in Genie Code
Databricks leverages MemAlign to enhance the evaluation of Genie Code’s traditional ML code generation, enabling automated 9-dimensional scoring via LLM judges and significantly narrowing the gap with human experts.
入选理由:MemAlign 使 LLM 判官评分与人类专家一致性提升至 0.85 相关系数。