QMIND - Queens AI Hub

Efficacy of K-Center Greedy Coresets, ProjectX 2nd PlaceDAIR

ProjectXCompetitiveResearchData Efficiency

SHORT PROJECT DESCRIPTION

This paper addresses data efficiency challenges in fine-tuning pre-trained Large Language Models (LLMs) by exploring coreset selection for Abstractive Text Summarization, a field with limited prior research, using the KCenterGreedy method to optimize training data quality and quantity; the study compares algorithmically chosen coresets with random subsets across three datasets of varying sizes, evaluating summary quality and training time, and finds that KCenterGreedy can reduce computational costs, offering promising avenues for sustainable and cost-efficient LLM training. The code can be found here: https://www.kaggle.com/code/rababazeem/pubmed-kcentergreedy

REAL WORLD IMPACT - What impact will this project have on the world of AI?

This project has significant implications for the future of AI by addressing the growing computational and environmental costs associated with fine-tuning Large Language Models (LLMs). As LLMs become more widely used, the demand for training resources increases, contributing to higher energy consumption and longer training times. By investigating coreset selection, particularly through the k-center greedy method, this research provides a practical solution to these challenges. Coreset selection enables models to be trained on smaller, high-quality subsets of data without sacrificing performance, thereby reducing computational costs and energy consumption. This work contributes to more sustainable, efficient, and scalable AI practices, paving the way for environmentally responsible model development while maintaining the performance standards required for real-world applications.

Menu

Find Us In More Places

Projects

SHORT PROJECT DESCRIPTION

REAL WORLD IMPACT - What impact will this project have on the world of AI?

SHORT PROJECT DESCRIPTION

REAL WORLD IMPACT - What impact will this project have on the world of AI?

SHORT PROJECT DESCRIPTION

REAL WORLD IMPACT - What impact will this project have on the world of AI?

SHORT PROJECT DESCRIPTION

REAL WORLD IMPACT - What impact will this project have on the world of AI?

SHORT PROJECT DESCRIPTION

REAL WORLD IMPACT - What impact will this project have on the world of AI?

SHORT PROJECT DESCRIPTION

REAL WORLD IMPACT - What impact will this project have on the world of AI?