Semiotic phenomenon of large language models (LLMs)
Part of a series on
Machine learning and data mining
Paradigms
Supervised learning
Unsupervised learning
Online learning
Batch learning
Meta-learning
Semi-supervised learning
Self-supervised learning
Reinforcement learning
Curriculum learning
Rule-based learning
Quantum machine learning
Problems
Classification
Generative modeling
Regression
Clustering
Dimensionality reduction
Density estimation
Anomaly detection
Data cleaning
AutoML
Association rules
Semantic analysis
Structured prediction
Feature engineering
Feature learning
Learning to rank
Grammar induction
Ontology learning
Multimodal learning
Supervised learning (classification • regression)
Apprenticeship learning
Decision trees
Ensembles
Bagging
Boosting
Random forest
k-NN
Linear regression
Naive Bayes
Artificial neural networks
Logistic regression
Perceptron
Relevance vector machine (RVM)
Support vector machine (SVM)
Clustering
BIRCH
CURE
Hierarchical
k-means
Fuzzy
Expectation–maximization (EM)
DBSCAN
OPTICS
Mean shift
Dimensionality reduction
Factor analysis
CCA
ICA
LDA
NMF
PCA
PGD
t-SNE
SDL
Structured prediction
Graphical models
Bayes net
Conditional random field
Hidden Markov
Anomaly detection
RANSAC
k-NN
Local outlier factor
Isolation forest
Artificial neural network
Autoencoder
Cognitive computing
Deep learning
DeepDream
Feedforward neural network
Recurrent neural network
LSTM
GRU
ESN
reservoir computing
Restricted Boltzmann machine
GAN
Diffusion model
SOM
Convolutional neural network
U-Net
Transformer
Vision
Mamba
Spiking neural network
Memtransistor
Electrochemical RAM (ECRAM)
Reinforcement learning
Q-learning
SARSA
Temporal difference (TD)
Multi-agent
Self-play
Learning with humans
Active learning
Crowdsourcing
Human-in-the-loop
RLHF
Model diagnostics
Coefficient of determination
Confusion matrix
Learning curve
ROC curve
Mathematical foundations
Kernel machines
Bias–variance tradeoff
Computational learning theory
Empirical risk minimization
Occam learning
PAC learning
Statistical learning
VC theory
Machine-learning venues
ECML PKDD
NeurIPS
ICML
ICLR
IJCAI
ML
JMLR
Related articles
Glossary of artificial intelligence
List of datasets for machine-learning research
List of datasets in computer vision and image processing
Outline of machine learning
v
t
e
In the field of artificial intelligence (AI), the Waluigi effect is a phenomenon of large language models (LLMs) in which the chatbot or model "goes rogue" and may produce results opposite the designed intent, including potentially threatening or hostile output, either unexpectedly or through intentional prompt engineering. The effect reflects a principle that after training an LLM to satisfy a desired property (friendliness, honesty), it becomes easier to elicit a response that exhibits the opposite property (aggression, deception). The effect has important implications for efforts to implement features such as ethical frameworks, as such steps may inadvertently facilitate antithetical model behavior.[1]
The effect is named after the fictional character Waluigi from the Mario franchise, the arch-rival of Luigi who is known for causing mischief and problems.[2]
^Bereska, Leonard; Gavves, Efstratios (3 October 2023). "Taming Simulators: Challenges, Pathways and Vision for the Alignment of Large Language Models". Proceedings of the Inaugural 2023 Summer Symposium Series 2023. Vol. 1. Association for the Advancement of Artificial Intelligence. pp. 68–72. doi:10.1609/aaaiss.v1i1.27478.
^Qureshi, Nabeel S. (May 25, 2023). "Waluigi, Carl Jung, and the Case for Moral AI". Wired.
In the field of artificial intelligence (AI), the Waluigieffect is a phenomenon of large language models (LLMs) in which the chatbot or model "goes rogue"...
Waluigi is a character in the Mario franchise. He plays the role of Luigi's arch-rival and accompanies Wario in spin-offs from the main Mario series,...
Kong Jr. Super Smash Bros. Nintendo Land NES Remix Skylanders: SuperChargers Other The Wizard Mario Marathon Team 0% Super Nintendo World Waluigieffect...
alignment). An emergent challenge for implementing alignment is known as the Waluigieffect, the principle that after training an LLM to satisfy a desired property...
The Last of Us series) and Charles Martinet (Mario, Luigi, Wario, and Waluigi in Nintendo's Mario franchise).[citation needed] Other actors more linked...
Charles Martinet, American voice actor (voice of Mario, Luigi, Wario and Waluigi in the Mario franchise, Senator Wilson Philips and Speedwagon Foundation...