Enfow's Blog
🦄 About
📚 Posts
Computer-Science
[OS]
from Batch processing to Time sharing
[OS]
Kernel and System call
[OS]
Monolithic kernel and MicroKernel
[OS]
What is Process
[OS]
Interprocess Communication
[OS]
CPU Scheduling
[OS]
Thread
[OS]
Multi Process & Multi Thread
[IP]
Routing Protocol BGP & OSPF
[IP]
ICMP Internet Control Message Protocol
[IP]
Transport Layer Protocols, TCP & UDP
[Programming]
Asymptotic Notation
[Programming]
Functional Programming and Static Type Checking With Ocaml
[Programming]
Keywords for OOP
Python
Python Bulit-in Time Complexity: List
Python Bulit-in Time Complexity: Set & Dictionary
Python import system
Python Global Interpreter Lock
Python String Formatting
% operator, format(), f'string'
Deep-Learning
Gradient Descent
Convolution Neural Network
Loss function and Maximum likelihood
Energy Based Model
Reinforcement-Learning-Introduction
Multi-armed Bandit
Markov Property and Bellman Equation
Dynamic Programming
Monte Carlo Methods
Temporal Difference
Planning and Learning
On-Policy Approximation of Action Values
Policy Gradient Method
Bellman Operator
Statistics
Expectation, Variance and Covariance
Probability Distributions
Bayesian Rule
Decision Theory
Shannon Entropy
MLE / MAP
ROC Curve
Linear-Algebra
Quick Review: Linear Algebra
Linear Algebra Introduction
Linear Combination
Linear Mapping
Inner Product and Norm
Least Square Problem
Orthogonality
EigenDecomposition
Singular Value Decomposition
Outper Product and Determinant
Optimization
[Optimization]
Optimization Problem
[ConvexOptimization]
Convex Optimization Introduction
Mathematics-for-ML
Taylor Series
Differentiation of Multivariate Function
Experiments
[DL]
Universal Approximation Experiment
Neural-Network Papers
[Universal Approximation]
Approximation by Superposition of a Sigmoidal Function
[Optimizer]
An overview of gradient descent optimization algorithms
[Activation]
Smooth Adversarial Training
[Batch Normalization]
Accelerating Deep Network Training by Reducing Internal Covariate Shift
Generative-Model Papers
[VAE]
Auto-Encoding Variational Bayes
[GAN]
Generative Adversarial Nets
[AAE]
Adversarial AutoEncoders
Model-Free-RL Papers
[DQN]
Playing Atari with Deep Reinforcement Learning
[DDQN]
Deep Reinforcement Learning with Double Q-learning
[Dueling DQN]
Dueling Network Architectures for Deep Reinforcement Learning
[DPG]
Deterministic Policy Gradient Algorithms
[DDPG]
Continuous Control with Deep Reinforcement Learning
[TD3]
Addressing Function Approximation Error in Actor-Critic Methods
[SAC]
Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
[OAC]
Better Exploration with Optimistic Actor-Critic
[TRPO]
Trust Region Policy Optimization
[PER]
Prioritized Experience Replay
[HER]
Hindsight Experience Replay
Model-Based-RL Papers
[DeepDyna Q]
Integrating Planning for Task-Completion Dialogue Policy Learning
[WorldModel]
World Models
[PlaNet]
Learning Latent Dynamics for Planning from Pixels
[Dreamer]
Dream to Control: Learning Behaviors by Latent Imagination
Parallel-RL Papers
[A3C]
Asynchronous Methods for Deep Reinforcement Learning
[IMPALA]
Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Multi-Agent-RL Papers
[COMA]
Counterfactual Multi-Agent Policy Gradients
Batch-RL Papers
[BCQ]
Off-Policy Deep Reinforcement Learning without Exploration
[REM]
Striving for Simplicity in Off-Policy Deep Reinforcement Learning
Distributional-RL Papers
[C51]
A Distributional Perspective on Reinforcement Learning
Meta-Learning Papers
[Matching Network]
Matching Networks for One Shot Learning
[MAML]
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Sequence-to-Sequence Papers
[RNN Encoder-Decoder]
Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
[Attention]
Neural Machine Translation by Jointly Learning to Align and Translate
Large-Language-Model Papers
OpenAI o1
[GPT-1]
Improving Language Understanding by Generative Pre-Training
[BERT]
Pre-training of Deep Bidirectional Transformers for Language Understanding
[GPT-2]
Language Models are Unsupervised Multitask Learners
[GPT-3]
Language Models are Few-Shot Learners
[Instruct GPT]
Training language models to follow instructions with human feedback
[Chain-of-Thought]
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
[Dialogue-generation]
Large Language Models meet harry potter
Transfer-Learning Papers
[Transfer Learning]
Learning and Transferring Mid-Level Image Representations using Convolutional Neural Networks
Graph-Neural-Network Papers
Relational Inductive Biases, Deep Learning, and Graph Networks
[GCN]
Semi-Supervised Classification with Graph Convolutional Networks
[LGCN]
Large-Scale Learnable Graph Convolutional Networks
Combinatorial-Optimization Papers
[Placement]
Device Placement Optimization with Reinforcement Learning
[Placement]
Chip Placement with Deep Reinforcement Learning
[NP-hard graph]
Learning Combinatorial Optimization Algorithms over Graphs
Anomaly-Detection Papers
Variational AutoEncoder based Anomaly Detection using Reconstruction Probability
Segmentation Papers
[FCN]
Fully Convolutional Networks for Semantic Segmentation
[3D Segmentation]
Deep Neural Networks for Anatomical Brain Segmentation
Semiconductor
[FPGA]
Field Programmable Gate Array
[Placement]
VLSI Cell Placement Technique
Robotics
[Industrial Robot]
Balancing a Robotic Spot Welding Manufacturing Line: an Industrial Case Study
Machine-Learning-project
[Python]
Pandas, Pytorch
[RL]
Install Mujoco Environment
[RL]
Mujoco Env: Reacher & Hopper
[RL]
Gym Documentation
[MlFlow]
mlflow Introduction
Environment-Settings
[Python]
pyenv, virtualenv, autoenv
[Vim]
Customizing Vim
[Git]
Github SSH Setting
[Git]
Git Pre-Commit Settings
[Tools]
Linting & Formatting
[Linux]
Linux Remote Server Settings
[Docker]
Docker Introduction
Embedded
[RaspberryPi]
Raspberry Pi initial Setup
Install Ubuntu & Connect WiFi
[RaspberryPi]
Raspberry Pi GPIO Control
Survo Moter Control With Raspberry Pi