0 Posted 2025-04-28Updated 2025-04-28Machine Learning7 minutes read (About 1102 words)

Backpropagation Hand by Hand

Backpropagation is the algorithm that trains neural networks by adjusting their weights to minimize the loss. It works by applying the chain rule of calculus to efficiently compute how the loss changes with respect to each weight. Starting from the output layer, it propagates the error backward through the network, layer by layer, updating the weights based on their contribution to the error.$$W^\ell\!\leftarrow W^\ell - \eta\, (a^ {\ell-1}) ^\top \delta^\ell,\quad b^\ell\!\leftarrow b^\ell - \eta\,\sum\delta^\ell.$$

GNN: Graph Neural Networks

© ChandlerBang

0 Posted 2025-04-23Updated 2025-07-17Machine Learning8 minutes read (About 1130 words)

GNN: Graph Neural Networks

Graph Neural Networks (GNNs) are a class of neural networks designed to work directly with graph-structured data. They have gained significant attention in recent years due to their ability to model complex relationships and interactions in various domains, including social networks, molecular biology, and recommendation systems.

0 Posted 2025-01-01Updated 2025-01-06Notes / Class / UIUC / AI12 minutes read (About 1859 words)

High Dimension Data

High Dimension Data

0 Posted 2024-12-30Updated 2024-12-30Notes / Class / UIUC / AI7 minutes read (About 1090 words)

AI: Logistic Regression

Logistic regression is a supervised machine learning algorithm used for binary classification tasks. Unlike linear regression, which predicts continuous values, logistic regression predicts the probability that a given input belongs to a certain class.

0 Posted 2024-12-30Updated 2024-12-30Notes / Class / UIUC / AI19 minutes read (About 2802 words)

Linear Model Optimization

Linear Model Optimization

0 Posted 2024-12-30Updated 2024-12-30Notes / Class / UIUC / AI12 minutes read (About 1796 words)

Regularization

Regularization is a way to make sure our model doesn't become too complicated. It ensures the model doesn’t overfit the training data while still making good predictions on new data. Think of it as adding a 'rule' or 'constraint' that prevents the model from relying too much on any specific feature or predictor.

0 Posted 2024-10-09Updated 2024-10-09Notes / Class / UIUC / AI7 minutes read (About 1005 words)

Softmax

Softmax is a mathematical function commonly used in machine learning, particularly in the context of classification problems. It transforms a vector of raw scores, often called logits, from a model into a vector of probabilities that sum to one. The probabilities generated by the softmax function represent the likelihood of each class being the correct classification. $$\sigma(\mathbf{z})_i = \frac{e^{z_i}}{\sum_{j=1}^K e^{z_j}}$$

Support Vector Machine

0 Posted 2024-09-29Updated 2024-10-09Notes / Class / UIUC / AI14 minutes read (About 2144 words)

Support Vector Machine

Support Vector Machine (SVM) is a supervised learning algorithm used for classification and regression. It finds the best hyperplane that separates the data into different classes with the largest possible margin. SVM can work well with high-dimensional data and use different kernel functions to transform data for better separation when it is not linearly separable.$$f(x) = sign(w^T x + b)$$

Random Forest

© Inna Logunova

0 Posted 2024-09-29Updated 2024-10-09Notes / Class / UIUC / AI16 minutes read (About 2423 words)

Random Forest

Random Forest is an ensemble machine learning algorithm that builds multiple decision trees during training and merges their outputs to improve accuracy and reduce overfitting. It is commonly used for both classification and regression tasks. By averaging the predictions of several decision trees, Random Forest reduces the variance and increases model robustness, making it less prone to errors from noisy data. $$\text{Entropy}_{\text{after}} = \frac{|S_l|}{|S|}\text{Entropy}(S_l) + \frac{|S_r|}{|S|}\text{Entropy}(S_r)$$

Understanding the Taylor Series and Its Applications in Machine Learning

© Karobben

0 Posted 2024-08-09Updated 2024-10-09Machine Learning / Math9 minutes read (About 1323 words)

Understanding the Taylor Series and Its Applications in Machine Learning

The Taylor Series is a mathematical tool that approximates complex functions with polynomials, playing a crucial role in machine learning optimization. It enhances gradient descent by incorporating second-order information, leading to faster and more stable convergence. Additionally, it aids in linearizing non-linear models and informs regularization techniques. This post explores the significance of the Taylor Series in improving model training efficiency and understanding model behavior. $$\cos(x) = \sum_{n=0}^{\infty} \frac{(-1)^n}{(2n)!} x^{2n}$$

Multi-layer Neural Nets

© Karobben

0 Posted 2024-02-18Updated 2025-04-28Notes / Class / UIUC / AI21 minutes read (About 3093 words)

Multi-layer Neural Nets

Multi-layer Neural Nets

Hidden Markov Model

© Karobben

0 Posted 2024-02-12Updated 2024-02-16Notes / Class / UIUC / AI8 minutes read (About 1152 words)

Hidden Markov Model

Hidden Markov Model

Perceptron

© Dr. Roi Yehoshua

0 Posted 2024-02-07Updated 2024-10-09Notes / Class / UIUC / AI8 minutes read (About 1175 words)

Perceptron

Perceptron

Linear Regression

© geeksforgeeks

0 Posted 2024-02-05Updated 2024-12-30Notes / Class / UIUC / AI32 minutes read (About 4772 words)

Linear Regression

Linear Regression

Learning Progress

© Shanthababu Pandian

0 Posted 2024-02-02Updated 2024-02-13Notes / Class / UIUC / AI6 minutes read (About 885 words)

Learning Progress

Learning Progress