[γlo'sapi]

Natural Language Processing was never that easy

Install glossAPI
~$

What is glossAPI?

GlossAPI is an open-source infostracture developed by Open Technologies Alliance (GFOSS)to transform raw Greek text from public consultation, science, education, literature, and culture into clean, well-documented, AI-ready data. As the Greek language remains underrepresented in large-scale AI datasets, GlossAPI provides the tools and workflows needed to create high-quality linguistic resources that are openly accessible and fully reproducible.

The project builds a foundational infrastructure for Greek Natural Language Processing by combining a robust, modular processing pipeline with a strong commitment to open standards. The pipeline covers every stage of document processing, automated downloading, text extraction, section segmentation, classification, and annotation, supporting multiple file formats while preserving structure and metadata.

High-quality datasets produced by GlossAPI are already available on Hugging Face, enabling research, education, digital humanities, NLP applications, and the development of Greek language models. GlossAPI is also being used in European projects to improve the understanding and processing of the Greek language in real-world contexts.

Beyond a tool, GlossAPI is a community: researchers, developers, linguists, and students collaborate in an open, participatory, and ethically aligned ecosystem for Greek language technology. Our goal is to foster a sustainable, transparent, and collaborative environment for Greek NLP.

Whether you are training models, building smarter search systems, or exploring Greek digital heritage, GlossAPI provides the foundations to build scalable, transparent, and socially responsible AI applications.

All datasets are released under Creative Commons licenses, and the source code is openly available on GitHub.

Our Team

Prof. Petros Stefaneas

Scientific Advisor

Petros is scientifically responsible for GlossAPI, guiding the development of principled and reliable training material for NLP systems. His leadership ensures that GlossAPI not only processes Greek text with technical precision, but also upholds clarity, trustworthiness, and ethical integrity.

Scientific Consultation Language Technology

Foivos Karounos

Chief Vibe Coder

Foivos Karounos has studied Computer Science and Psychology and is interested in the development of the technological ecosystem in Greece. He has taken on various roles related to business strategy, cryptocurrency performance forecasting, and research in epistemology. His role in the glossAPI team is that of the lead Software Engineer.

Software Architecture Project Management Language Technologies

Nikos Tsekos

Software Engineer

Nikos Tsekos is an undergraduate Computer Engineering student and Software Engineer focused on machine learning applications. He currently works with GFOSS (Open Technologies Alliance) on the GlossAPI team, contributing to data pipelines, applied ML workflows, and open-source tools that improve access to scientific knowledge.

Computational Linguistics NLP Systems Machine Learning

Ioanna Moura

Linguistics Specialist

Ioanna Moura is a linguist and a trainee interpreter in Greek Sign Language (GSL). She completed her undergraduate studies in Greek Philology and her postgraduate studies in Language Technology at the National and Kapodistrian University of Athens. She has also worked at the National Hellenic Research Foundation in the field of archival research and was a researcher at Istorima. She joined the team at the Open Technologies Alliance (GFOSS) in April 2024.

Greek Philology Language Technology Greek Sign Language

Myrsini Ioannou

Software Engineer

Myrsini Ioannou studied Applied Mathematics and Physical Sciences at the National Technical University of Athens and holds a Master's degree in Sound and Music Computing from Universitat Pompeu Fabra in Barcelona. She has worked as a Data Scientist and joined the glossAPI team in March 2025 as a Software Engineer, where she focuses on the development and optimization of natural language processing technologies.

Applied Mathematics Sound Computing NLP Optimization

Join Our Community

GitHub

Check out our source code, contribute, and follow our development.

Visit Repository

Hugging Face

Explore our datasets

Visit Hugging Face

Discord

Join our community for discussions, support, and updates.

Join Discord