О.К.Б Саблин

/About%20ADHD%20Symptoms%20and%20Hormones

healthcare

About ADHD Symptoms and Hormones

Written after I fucked up my therapy due to gastrointestinal disorder as a remainder. Don’t take it seriously.

/Brief%20Dive%20into%20Macro-o1%20and%20o1-like%20LLMs

LLM

Brief Dive into Macro-o1 and o1-like LLMs

Macro-o1 7B is the first o1-like open source LLM. In this post, we will dive into its training paradigm and its inference implementation.

/Kyle's%20Market%20Micro-architecture%20Model

game-theory

Kyle's Market Micro-architecture Model

The trading behavior in a market is often backed by asymmetric information, making the market a asymmetric game. Albert S. Kyle proposed a micro-structure formulation in 1985, stating that such market consists of three players: an insider, a random noise trader, and a market maker. His successors also proved that there exist an equilibrium with additional conditions. In this post, we will discuss the formulation of Kyle’s model formulated as an extensive form game and an Optimal-Transport based setting of equilibrium.

EBM

From MPC to Diffusion Policy

In this post, we will try to connect Energy-Based Model with classical optimal control frameworks like Model-Predictive Control from the perspective of Lagrangian optimization.

/Notes%20on%20'The%20Energy-Based%20Learning%20Model'%20by%20Yann%20LeCun,%202021

EBM

Notes on 'The Energy-Based Learning Model' by Yann LeCun, 2021

In this note we connect the energy concepts including Lagrangian Dynamics and Gibbs Formula to measuring the quality of prediction. Then we go over the inferencing and training in multi-modal scenarios.

/How%20Would%20Diffusion%20Model%20Help%20Robot%20Imitation

robotics

How Would Diffusion Model Help Robot Imitation

In short, using diffusion process instead of directly applying MSE loss on trajectories enables a wider variety of trajectory solutions learnt from imitation data, instead of the trajectory provided by dataset(s), which is beneficial for small-set imitation learning. In this blog post, we will discuss how and why diffusion process can achieve such and trajectory-agnostic result.

diffusion

Notes on ScoreGrad

ScoreGrad is a EBM make use of iterative conditional SDE sampling via diffusion to perform multi-variate probabilistic time series prediction. Basically it uses RNN/LSTM/GRU to encode past time series as a condition and sample a probability distribution of predicting time series based on this.

GPU

Swizzling: The Snake Trying Not to Bite Itself

Abstract To avoid shared memory conflict in shared memory for each thread in different banks, swizzling is invented to create offsets for memory allocation. Which means, finding places for the memory that requested by a Layout.

/%5BReading%5DPARIS:%20Part-level%20Reconstruction%20and%20Motion%20Analysis%20for%20Articulated%20Objects

NeRF

[Reading]PARIS: Part-level Reconstruction and Motion Analysis for Articulated Objects

ABSTRACT: This paper introduces a self-supervised, end-to-end architecture that learns part-level implicit shape and appearance models and optimizes motion parameters jointly without requiring any 3D supervision, motion, or semantic annotation. The training process is similar to original NeRF but and extend the ray marching and volumetric rendering procedure to compose the two fields.

/%E3%80%90Reading%E3%80%91Ditto-Building%20Digital%20Twins%20of%20Articulated%20Objects%20from%20Interaction

Interactive-Perception

【Reading】Ditto-Building Digital Twins of Articulated Objects from Interaction

This paper propose a way to form articulation model of articulated objects by encoding the features and find the correspondence of static and mobile part via visual observation before and after the interaction.