Metagente

This repository contains the source code implementation of metagente and the datasets used to replicate the experimental results

Getting Started

Dependencies

Python version 3.10.12
Python packages are listed in requirements.txt
MongoDB

Running the application

Installing necessary packages:

pip install -r requirements.txt

Running the optimization code:

python main.py
--train_data_file data/train_data.csv
--train_result_dir result/train

Running the evaluation code:

python evaluation.py
--test_data_file data/test_data.csv
--test_result_dir result/test

Repositories structure

This repository contains the source code, datasets, and results for the experiments described in our paper. The structure of the project is as follows:

Data

This folder contains the input datasets used in the experiments.

ES.csv: Main dataset used for the experiments.
TS10.csv: Subset of the dataset used for testing with 10 samples.
TS50.csv: Subset of the dataset used for testing with 50 samples.

Results

This folder contains the outputs generated during the experiments.

GITSUM_TS10.txt: Results generated by the GITSUM model for the TS10 dataset.
GITSUM_TS50.txt: Results generated by the GITSUM model for the TS50 dataset.
LLAMA_TS10.csv: Summary results from the LLAMA model for the TS10 dataset.
LLAMA_TS50.csv: Summary results from the LLAMA model for the TS50 dataset.
METAGENTE: This folder contains parallel and sequential results from metagente for TS10 dataset. Moreover, Statistical tests are provided.

Tools

This folder contains the Python scripts used to run the experiments and generate results.

GITSUM.py: Script for running the GITSUM model on the datasets.
LLAMA_SUMMARY.py: Script for processing and summarizing results from the LLAMA model.

How to cite

If you use the tool or the dataset in your research, please cite our work using the following BibTex entries:

@inbook{10.1145/3696630.3728511,
   author = {Nguyen, Duc S. H. and Truong, Bach G. and Nguyen, Phuong T. and {Di Rocco}, Juri and {Di Ruscio}, Davide},
   title = {Teamwork makes the dream work: LLMs-Based Agents for GitHub README.MD Summarization},
   year = {2025},
   isbn = {9798400712760},
   publisher = {Association for Computing Machinery},
   address = {New York, NY, USA},
   url = {https://doi.org/10.1145/3696630.3728511},
   abstract = {The proliferation of Large Language Models (LLMs) in recent years has realized many applications in various domains. Being trained with a huge of amount of data coming from    various sources, LLMs can be deployed to solve different tasks, including those in Software Engineering (SE). Though they have been widely adopted, the potential of using LLMs cooperatively has not been thoroughly investigated.In this paper, we proposed Metagente as a novel approach to amplify the synergy of various LLMs. Metagente is a Multi-Agent framework based on a series of LLMs to self-optimize the system through evaluation, feedback, and cooperation among specialized agents. Such a framework creates an environment where multiple agents iteratively refine and optimize prompts from various perspectives. The results of these explorations are then reviewed and aggregated by a teacher agent. To study its performance, we evaluated Metagente with an SE task, i.e., summarization of README.MD files, and compared it with three well-established baselines, i.e., GitSum, LLaMA-2, and GPT-4o. The results show that our proposed approach works efficiently and effectively, consuming a small amount of data for fine-tuning but still getting a high accuracy, thus substantially outperforming the baselines. The performance gain compared to GitSum, the most relevant benchmark, ranges from 27.63\% to 60.43\%. More importantly, compared to using only one LLM, Metagente boots up the accuracy to multiple folds.},
   booktitle = {Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering},
   pages = {621–625},
   numpages = {5}
}

and

@article{Nguyen:2026:AUSE:Metagente,
   doi = {10.1007/s10515-025-00588-4},
   url = {https://doi.org/10.1007/s10515-025-00588-4},
   year = 2026,
   month = {feb},
   publisher = {Springer Science and Business Media {LLC}},
   author = {Nguyen, Duc S. H. and Nguyen, Minh T. and Nguyen, Phuong T. and {Di Rocco}, Juri and {Di Ruscio}, Davide},
   title = {Automated summarization of software documents: an LLM-based multi-agent approach},
   journal = {Automated Software Engineering}
}

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
AUSE		AUSE
Baselines		Baselines
Data		Data
METAGENTE		METAGENTE
Results		Results
.gitignore		.gitignore
Introduzione_AI_per_studenti_superiori.pptx		Introduzione_AI_per_studenti_superiori.pptx
README.md		README.md
copilot-example.py		copilot-example.py
copilot-example.text		copilot-example.text
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Metagente

Getting Started

Dependencies

Running the application

Repositories structure

Data

Results

Tools

How to cite

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Metagente

Getting Started

Dependencies

Running the application

Repositories structure

Data

Results

Tools

How to cite

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages