Where can I download the EMNIST dataset?

The dataset can be downloaded from the NIST website or the provided links to the Matlab and binary format files.

Is the EMNIST dataset free to use?

Yes, the EMNIST dataset is free for research and academic purposes.

EMNIST Dataset

EMNIST Dataset | Find AI List

Overview

The EMNIST dataset is a collection of handwritten characters and digits derived from the NIST Special Database 19. It's converted into a 28x28 pixel image format, mirroring the structure of the original MNIST dataset. EMNIST offers six different splits, including ByClass, ByMerge, Balanced, Letters, Digits, and MNIST, catering to diverse needs from unbalanced character sets to balanced digit recognition. It’s available in Matlab and binary formats for ease of use with various machine learning frameworks. The primary value proposition lies in providing expanded and balanced datasets for training and evaluating character recognition models. Researchers can leverage it to improve OCR systems, handwriting recognition software, and develop new algorithms. It supports use cases ranging from basic digit classification to complex character differentiation tasks, contributing to advancements in automated text processing.

Common tasks

Handwritten Character Recognition Digit Classification OCR Model Training

FAQ

View all

What is the EMNIST dataset?

The EMNIST dataset is a set of handwritten character digits derived from the NIST Special Database 19 and converted to a 28x28 pixel image format.

What formats is the dataset available in?

The dataset is provided in two file formats: Matlab and a binary format compatible with the original MNIST dataset.

What are the different splits provided in the dataset?

The dataset includes six different splits: EMNIST ByClass, EMNIST ByMerge, EMNIST Balanced, EMNIST Letters, EMNIST Digits, and EMNIST MNIST.

How should I cite the EMNIST dataset?

Please cite the following paper: Cohen, G., Afshar, S., Tapson, J., & van Schaik, A. (2017). EMNIST: an extension of MNIST to handwritten letters.

FAQ+

What is the EMNIST dataset?

The EMNIST dataset is a set of handwritten character digits derived from the NIST Special Database 19 and converted to a 28x28 pixel image format.

What formats is the dataset available in?

The dataset is provided in two file formats: Matlab and a binary format compatible with the original MNIST dataset.

What are the different splits provided in the dataset?

The dataset includes six different splits: EMNIST ByClass, EMNIST ByMerge, EMNIST Balanced, EMNIST Letters, EMNIST Digits, and EMNIST MNIST.

How should I cite the EMNIST dataset?

Please cite the following paper: Cohen, G., Afshar, S., Tapson, J., & van Schaik, A. (2017). EMNIST: an extension of MNIST to handwritten letters.

View all

Compare with top alternatives

Full compare

Tool	Pricing	Rating	Visits
EMNIST DatasetCurrent	Free	-	-
PyOD	Free	★ 0.0	-
Pylint	Free	★ 0.0	-
Publuu	Freemium	★ 0.0	-

EMNIST Dataset

Current

Pricing: Free
Rating: -
Visits: -

PyOD

Pricing: Free
Rating: ★ 0.0
Visits: -

Pylint

Pricing: Free
Rating: ★ 0.0
Visits: -

Publuu

Pricing: Freemium
Rating: ★ 0.0
Visits: -

EMNIST Dataset

Should you use EMNIST Dataset?

Overview

FAQ

Pricing

Pros & Cons

Compare with top alternatives

Reviews & Ratings