ML Scientist

Connecting Scholars with the Latest Academic News and Career Paths

FeaturedNews

AraFinNews: Largest Arabic Financial-News Summarisation Dataset Released

AraFinNews, the largest Arabic financial-news summarisation dataset, is released with 212,500 article-headline pairs. Available on GitHub and HuggingFace.

AraFinNews, the largest Arabic financial-news summarisation dataset to date, has been released. It includes 212,500 articleโ€“headline pairs from reputable financial media between 2015 and 2025.

The dataset is available on GitHub and HuggingFace. The associated research paper can be found on arXiv.

  • Clean structured text and article-level metadata
  • Ready-to-use splits for summarisation and downstream financial NLP tasks
  • Evaluation of mT5, AraT5, and FinAraT5 models demonstrating gains from financial-domain pretraining

For more NLP datasets and tools, visit ArabicNLP.uk and explore the work of Dr Mo El-Haj on https://elhaj.uk and https://vinnlp.com.

Tags: Arabic NLP, Financial News Summarisation, AraFinNews Dataset, NLP Research, Natural Language Processing, AI/ML Dataset