Open Data Sources for Machine Learning

A curated list of open data sources for machine learning research and projects. Popular Open Data Repositories OpenML — collaborative platform with thousands of datasets and benchmarks Kaggl...

Apr 15, 2026 AIML

Overview of the Transformers Library

The Hugging Face transformers library provides a unified API for loading, running, and fine-tuning pretrained models. The basic pipeline looks like this: (Raw text) → Tokenizer (input ids) → Model...

Apr 14, 2026 AIML

Python Programming Roadmap

A structured roadmap for going from Python beginner to expert, organised by skill level. Basic Variables and data types Basic data structures: lists, tuples, sets, dictionaries Boolean con...

Apr 8, 2026 AIML

Choosing the Right Transformer: Encoder, Decoder, or Both?

The transformer architecture [1] introduced in 2017 has become the backbone of virtually every modern language model. But not all transformers are built the same way. Depending on how you wire the ...

Apr 7, 2026 AIML

Why LoRA Isn't Enough for Domain Pretraining

When adapting a large language model to a new domain, LoRA is usually the first tool researchers reach for. It is fast, memory-efficient, and works remarkably well for instruction fine-tuning. But ...