BioChemCalc

Image data Classification

Image data Classification is a tool that converts image data to grayscale, performs dimensionality
reduction using PCA and t-SNE, and clusters the data using K-means and HCD.

Please upload up to 50 images. Each image must be less than 400KB, and the filenames should follow this format: Class_Number (e.g., A_1.png, A_2.png, B_1.png).

Upload finish

Results:

TSNE image K-means image DBSCAN image HCD image

t-SNE & PCA Parameters

Parameter Value

What is Image Data Classification?

Image data classification is a technique that allows researchers to group, compare, and explore patterns within large sets of biological images. This tool supports key steps such as grayscale conversion, dimensionality reduction via PCA and t-SNE, and clustering using k-means or hierarchical clustering (HCD).

With this tool, users can upload up to 50 images, analyze them visually, and identify hidden structures in their data—without needing to install any software. Each image is automatically processed to ensure proper size and format, and interactive plots are generated to show cluster separation and sample relationships.

BioChemCalc’s classification tool is free, web-based, and easy to use, making it ideal for both researchers and educators working in biology, medicine, or data science.

Traditionally, researchers relied on manual classification of histological or cellular images based on visual observation. This tool allows for a shift to statistically driven image analysis using PCA and t-SNE, reducing subjective bias and enhancing reproducibility.

It is particularly useful for classifying image sets such as wild-type vs mutant vs treated conditions, and can reveal subtle internal variations or progression patterns that may be difficult to detect visually. Time-course changes can also be tracked quantitatively through cluster shifts.

In addition to visualizing data structure, this tool can uncover outlier images, assist in quality control, and serve as an educational aid in teaching biological data interpretation with machine learning concepts.