


What if the most labor-intensive part of creating a research manuscript, such as creating publication-ready figures, could be accomplished in a matter of minutes? Such is the promise of PaperBanana. Created by researchers from Peking University and Google Cloud AI Research, PaperBanana is an AI-based, agentic system that automatically converts scientific text into professional-quality academic figures. Such a breakthrough for students, researchers, and academics in higher education is a paradigm shift in the visualization of scientific knowledge.
In the midst of the rapid evolution of AI-based tools in higher education, PaperBanana is a very specific tool for research visualization. Unlike most AI-based tools, PaperBanana does not employ a single AI agent to accomplish a specific task. Instead, the system employs a cohort of five AI agents to accomplish the tasks of reference gathering, planning, style adjustment, rendering, and self-evaluation of the output. Such figures are of the quality of, if not superior to, those created by human experts.
Key Takeaway: PaperBanana achieved a 72.7% win rate against baseline AI methods in blind human evaluation, with its figures rated superior in conciseness, readability and aesthetics (Zhu et al., 2026).
PaperBanana is an open-source agent-based system that is developed to facilitate the automation of the production of publication-standard illustrations for academic publications. The system was developed by Dawei Zhu et al. in January 2026 and is available on arXiv under the identifier 2601.23265. The system is a solution to the widely recognized problem of the time-consuming production of methodology illustrations and statistical figures for academic publications.
Unlike most image generation systems, the PaperBanana system is based on a multi-agent approach that is supported by the latest vision and language technology. The system operates in two different modes:

Phase 1 – Linear Planning: In this phase, three specific agents are involved.
Phase 2 – Iterative Refinement: In this phase, two agents are involved.
The researchers created a benchmark called PaperBananaBench to test this framework. This benchmark consists of 292 test cases that are created from NeurIPS publications from 2025. This is one of the first tests created to assess AI-generated diagrams for academic papers. This test covers four different domains: Agent & Reasoning, Vision & Perception, Generative & Learning, and Science & Applications.
The implications of the use of artificial intelligence for scientific investigation go beyond the realm of convenience. “PaperBanana” represents a new category of scientific visualization software that could fundamentally change the way knowledge is disseminated throughout the entire academic community.
For graduate students, it can take weeks to develop the figures required for theses and publications. Students often do not have the luxury of using professional design software or training in the field. Using “PaperBanana,” a student can enter a description of his/her methodology and create a professionally designed diagram with modern color schemes, layouts, and typography appropriate for the academic community, which could otherwise require extensive manual labor using software such as Adobe Illustrator or PowerPoint.
Did You Know? PaperBanana scored 80.7% in conciseness – significantly outperforming even human experts (50.0%) – meaning it produces cleaner, more focused diagrams with less visual clutter.
Active researchers continually face the pressure of publishing, which can cause delays in the authoring process due to issues in the creation of figures. PaperBanana’s prototyping capability allows researchers to create multiple visual representations of the same methodologies, communicate effectively, and conduct various experiments. It also includes the generation of statistical plots, which can be done by producing Python Matplotlib codes to ensure numerical precision in the data-based images created by the researchers.
Researchers can utilize PaperBanana as a tool to aid their students in their research and as a means to teach them the principles of efficient scientific communication. By comparing the images created by the students and the images created by the AI, the students can be taught the principles of efficient scientific communication. PaperBanana can also be used to improve the images created by the students, as it can enhance the images created by the students by increasing their aesthetic appeal by up to 56.2% in 292 cases, as demonstrated by the framework’s performance in the test cases.
Researchers can utilize PaperBanana as it can be integrated into the research writing process. The researchers can input the text of the methodology section of the research and the corresponding figure’s caption, and the framework can automatically retrieve the necessary reference styles, plan the content of the images, follow the academic design standards, and create the images, all of which can be done without the researchers having to know the principles of design.
The framework can create images of methodologies, which can be done by creating complex diagrams, architectural diagrams, pipeline diagrams, and other diagrams necessary in the publication of AI and computer science-related research. PaperBananaBench demonstrated that the framework can achieve a score of 60.2, which is higher than the human reference score of 50.0. The framework performed the best in the creation of Agent and Reasoning diagrams, which can achieve up to 69.9%, while the lowest performance was seen in the creation of Vision and Perception diagrams, which can achieve up to 52.1%.
PaperBanana performance compared to human experts across five quality dimensions (data from PaperBananaBench; Zhu et al., 2026)
In addition to the creation of new figures, PaperBanana also has the ability to improve existing diagrams. This is because the aesthetic guidelines provided by the Stylist Agent could be used to improve the diagrams created manually. From the results of the empirical tests conducted, it was evident that the enhancement of the diagrams made them look better in 56.2% of cases. This is an indication that the tool could be useful in providing researchers with AI assistance in their work.
PaperBanana is part of an emerging sector of AI tools for academic research that are transforming universities around the world. From AI writing assistants to literature review tools, the academic research landscape is increasingly being augmented by AI tools. The agent-based approach, which uses several specialized AI agents to work together on tasks, is at the cutting edge of this revolution.
The future of AI in universities is therefore expected to be characterized by these tools becoming an integral part of the research landscape, similar to the current use of statistical software or reference managers. However, their effective integration into academic research must be informed by clear policies.
The PaperBanana team recognizes that one of the biggest risks in the use of AI tools in research is “visual hallucination,” in which illustrations created by the tool may include details that are not supported by the source material. The most common errors include “fine-grained connectivity” errors, which include “misaligned arrows” as well as “flow direction” errors (Zhu et al., 2026).
This is an indication that there is a need for researchers to be cautious when using AI tools in their work. The fact that the most common errors are those involving “fine-grained connectivity” is an indication that AI visualization tools must be used in collaboration with human experts.
Important: PaperBanana’s faithfulness score (45.8%) remains below the human reference (50.0%). This means human review for factual accuracy is still essential before any AI-generated figure is published.
The PaperBanana framework is open-source and available on GitHub. This provides an opportunity for universities to explore, adapt, and extend this technology. Some opportunities that could be taken in the future include:
Gulf University understands that artificial intelligence for students and researchers is no longer a promise of the future, but rather an undeniable reality. As the world over tries to figure out the best way to incorporate artificial intelligence into academic scholarship, Gulf University is committed to leading the digital revolution in education.
Innovative research frameworks like PaperBanana are the opportunities that our academic community can benefit from as well as contribute to. Gulf University can assess these new AI technologies, create guidelines on their responsible use, and be at the forefront of the digital revolution in higher education.
We thus strongly encourage our faculty, researchers, and students to explore the opportunities offered by AI-powered research visualization tools, critically, responsibly, and with the utmost commitment to academic integrity.
The emergence of AI in academic research poses interesting questions, which we think it’s essential to discuss within our academic community:
Your input matters. We would love to hear from you. You can share your thoughts in the comments section, or get in touch with the Gulf University Research & Innovation Office to discuss the opportunities offered by AI-powered tools to help you achieve your research objectives.
The future of scientific communication is being written today. Whether you’re a student embarking on your very first research project, a faculty member helping shape the next generation of academics, or an established researcher pushing the frontiers of knowledge, AI-powered research tools like PaperBanana can help you improve the quality of your work.
Ready to explore? Visit the PaperBanana project page at https://dwzhu-pku.github.io/PaperBanana/ to learn more about the framework and access the open-source code. Stay tuned for upcoming Gulf University workshops on AI tools for research.
At Gulf University, innovation happens through inquiry. This article advocates for advancing the future of artificial intelligence in higher education through a framework that incorporates responsibility, ethical considerations, and academic standards of excellence that are characteristic of high-quality scholarship.
Prof. Mohanad Alfiras
Gulf University
Last Updated: 09 Apr 2026