JieSun LOGBOOK

Posted 2023-12-24Updated 2024-03-31Tech / Grokking-interviews / ML / System-Design / ML4 minutes read (About 619 words)

Problem Statement

Design a Twitter with 500 million daily active users feed system that will show the most relevant tweets for a user based on their social graph.

Posted 2023-12-19Updated 2024-03-31Tech / Grokking-interviews / ML / System-Design / ML6 minutes read (About 861 words)

Search Ranking

Problem Statement

Ask for questions: Scale, Scope, Personalization.

Scope: general search or specialized search?
Scale: number of websites? QPS (queries per second)?
Personalization: logged-in user or not

Posted 2023-12-18Updated 2024-03-31Tech / Intro5 minutes read (About 683 words)

Intro to Grad-CAM - CNN的可视化

The Grad-CAM (Gradient-weighted Class Activation Mapping) is a generalization of CAM and is applicable to a significantly broader range of CNN model families.
The intuition is to expect the last convolutional layers to have the best compromise between high-level semantics and detailed spatial information which is lost in fully-connected layers. The neurons in these layers look for semantic class-specific information in the image.

$$L_{Grad-CAM}^c = ReLU(\sum_k\alpha_k^cA^k)$$

where $$\alpha_k^c = \frac{1}{Z}\sum_i\sum_j\frac{\partial{y_c}}{\partial{A_{ij}^k}}$$

Posted 2023-12-12Updated 2024-03-31Tech4 minutes read (About 654 words)

Image preprocessing in Xception model

The image preprocessing process for the Xception model typically includes the following steps:

Size Adjustment:
- The Xception model expects the input image size to usually be 299x299 pixels.
Color Channel Processing:
- The Xception model expects the input to be a color image, i.e., having 3 color channels (Red, Green, Blue). If your image is grayscale (single-channel), you need to convert it into a three-channel format.

Posted 2023-12-01Updated 2024-03-31Tech / Intro4 minutes read (About 591 words)

Intro to GAN

Supervised or unsupervised?

Unsupervised task: generative modeling is an unsupervised task where the model is not told what kind of patterns to look for in the data and there is no error metric to improve the model.
Supervised classifier/loss func: the training process of the GAN is posed as a supervised learning problem with the help of a discriminator.

Posted 2023-11-28Updated 2024-03-31Tech / Intro3 minutes read (About 483 words)

Intro to AE and VAE

Discrimitive | Generative | Latent Models

Posted 2023-11-21Updated 2024-03-31Tech3 minutes read (About 385 words)

K-Fold Cross Validation

The K-fold cross validation is to divide the training data into K parts, using K-1 of them for training and the remaining part for testing. Finally, take the average of the testing errors as the generalization error. This allows for better utilization of the training data.
However, I encountered some problems when I was trying two kinds of k-fold cross validation methods. Firstly, we need to understand the significance of data division.

The training set is used to train the model and obtain its parameters.
The validation set is used for tuning the model’s hyperparameters.
The test set is used to evaluate the model’s performance.

Posted 2023-11-14Updated 2025-04-18Tech7 minutes read (About 975 words)

Layer Norm | Batch Norm | Instance Norm | Group Norm

LB | BN | IN | GN in NLP

Batch Norm: Normalizes each feature across the entire batch. Rarely used in NLP, because it relies on consistent sequence lengths and large batch sizes, and is sensitive to padding.
Layer Norm: Normalizes across the feature dimension for each token independently, making it suitable for variable-length sequences
Instance Norm: Originally used in computer vision to normalize each channel within each sample. It is not commonly applied in NLP tasks.
Group Norm: Splits the feature (embedding) dimension into groups and performs normalization within each group. It’s occasionally used in NLP when LayerNorm is replaced for better generalization under small-batch or resource-constrained settings.

Posted 2023-11-08Updated 2024-03-31Tech4 minutes read (About 573 words)

Depthwise Separable Convolutions

Inception Modules

In a convolutional layer, a single convolution kernel is tasked simultaneously mapping cross-channel correlations and spatial correlations. The inception module is to make this process easier and reduce computational expense by decoupling the depthwise convolution (i.e. spatial convolution over each channel) and pointwise convolution (i.e. 1x1 kernel for cross-channel operations).

Posted 2023-10-24Updated 2024-03-31Tech4 minutes read (About 611 words)

A multi-thread media stream in C++ based on OpenCV and OpenGL

Video Display

We did the video capture using OpenCV, and the video display .

Problem Statement

Problem Statement

Supervised or unsupervised?

Discrimitive | Generative | Latent Models

LB | BN | IN | GN in NLP

Inception Modules

Video Display

Categories

Tags

Subscribe for updates

Recents

Archives