List of datasets in computer vision and image processing information
Part of a series on
Machine learning and data mining
Paradigms
Supervised learning
Unsupervised learning
Online learning
Batch learning
Meta-learning
Semi-supervised learning
Self-supervised learning
Reinforcement learning
Curriculum learning
Rule-based learning
Quantum machine learning
Problems
Classification
Generative modeling
Regression
Clustering
Dimensionality reduction
Density estimation
Anomaly detection
Data cleaning
AutoML
Association rules
Semantic analysis
Structured prediction
Feature engineering
Feature learning
Learning to rank
Grammar induction
Ontology learning
Multimodal learning
Supervised learning (classification • regression)
Apprenticeship learning
Decision trees
Ensembles
Bagging
Boosting
Random forest
k-NN
Linear regression
Naive Bayes
Artificial neural networks
Logistic regression
Perceptron
Relevance vector machine (RVM)
Support vector machine (SVM)
Clustering
BIRCH
CURE
Hierarchical
k-means
Fuzzy
Expectation–maximization (EM)
DBSCAN
OPTICS
Mean shift
Dimensionality reduction
Factor analysis
CCA
ICA
LDA
NMF
PCA
PGD
t-SNE
SDL
Structured prediction
Graphical models
Bayes net
Conditional random field
Hidden Markov
Anomaly detection
RANSAC
k-NN
Local outlier factor
Isolation forest
Artificial neural network
Autoencoder
Cognitive computing
Deep learning
DeepDream
Feedforward neural network
Kolmogorov–Arnold Network
Recurrent neural network
LSTM
GRU
ESN
reservoir computing
Restricted Boltzmann machine
GAN
Diffusion model
SOM
Convolutional neural network
U-Net
Transformer
Vision
Mamba
Spiking neural network
Memtransistor
Electrochemical RAM (ECRAM)
Reinforcement learning
Q-learning
SARSA
Temporal difference (TD)
Multi-agent
Self-play
Learning with humans
Active learning
Crowdsourcing
Human-in-the-loop
RLHF
Model diagnostics
Coefficient of determination
Confusion matrix
Learning curve
ROC curve
Mathematical foundations
Kernel machines
Bias–variance tradeoff
Computational learning theory
Empirical risk minimization
Occam learning
PAC learning
Statistical learning
VC theory
Machine-learning venues
ECML PKDD
NeurIPS
ICML
ICLR
IJCAI
ML
JMLR
Related articles
Glossary of artificial intelligence
List of datasets for machine-learning research
List of datasets in computer vision and image processing
Outline of machine learning
v
t
e
This is a list of datasets for machine learning research. It is part of the list of datasets for machine-learning research. These datasets consist primarily of images or videos for tasks such as object detection, facial recognition, and multi-label classification.
and 20 Related for: List of datasets in computer vision and image processing information
Computervision tasks include methods for acquiring, processing, analyzing and understanding digital images, and extraction of high-dimensional data from...
parameters were shown to be comparable to vision transformers of similar size on ImageNet and similar image classification tasks. If a multilayer perceptron...
faster computersand big data, has led to achievements in speech recognition, object recognition inimages, image captioning, language translation and world...
A vision transformer (ViT) is a transformer designed for computervision. A ViT breaks down an input image into a series of patches (rather than breaking...
Proximal policy optimization (PPO) is an algorithm in the field of reinforcement learning that trains a computer agent's decision function to accomplish difficult...
its characteristics. Words, phrases, or entire documents, andimages, audio, and other types of data can all be vectorized. These feature vectors may be...
Can It Be? Estimating the Difficulty of Visual Search in an Image". 2016 IEEE Conference on ComputerVisionand Pattern Recognition (CVPR) (PDF). pp. 2157–2166...
used for applications such as computervisionand natural language processing, originally developed by Meta AI and now part of the Linux Foundation umbrella...
Institute of Standards and Technology database) is a large database of handwritten digits that is commonly used for training various imageprocessing systems...
bootstrap, and out-of-bag datasets. Each section below will explain how each dataset is made except for the original dataset. The original dataset is whatever...
as noise, shadowing, and variations in cameras, traditional hard clustering is often unable to reliably perform imageprocessing tasks as stated above...
recognition model developed by Hinton et al, the ReLU used in the 2012 AlexNet computervision model andin the 2015 ResNet model. Aside from their empirical performance...
compute for datasets larger than a couple of thousand examples without parallel processing. Kernel methods owe their name to the use of kernel functions...
Vision transformers adapt the transformer to computervision by breaking down input images as a series of patches, turning them into vectors, and treating...
instead of a flat result. In 1972, Robert F. Ling published a closely related algorithm in "The Theory and Construction of k-Clusters" in The Computer Journal...
copies of existing data. Synthetic Minority Over-sampling Technique (SMOTE) is a method used to address imbalanced datasetsin machine learning. In such...
normalize the range of independent variables or features of data. In data processing, it is also known as data normalization and is generally performed...