cd /home/$USER/workspace/
git clone [email protected]:behnamasadi/robotic_notes.git
vcpkg is configured as a git submodule. Initialize it:
cd /home/$USER/workspace/robotic_notes
git submodule update --init --recursive
set the path:
export VCPKG_ROOT=$PWD/vcpkg
export PATH=$VCPKG_ROOT:$PATH
Setting VCPKG_ROOT tells vcpkg where your vcpkg instance is located.
Install required system dependencies for vcpkg (on Linux):
sudo apt-get install -y bison flex build-essential cmake autoconf autoconf-archive automake libtool libltdl-dev libx11-dev libxft-dev libxext-dev libxtst-dev libxrandr-dev ninja-build pkg-config
Now you can run:
cmake -S . -B build \
-DCMAKE_TOOLCHAIN_FILE=./vcpkg/scripts/buildsystems/vcpkg.cmake \
-DCMAKE_BUILD_TYPE=Release \
-DVCPKG_TARGET_TRIPLET=x64-linux-release
The VCPKG_TARGET_TRIPLET=x64-linux-release option ensures vcpkg only builds release packages, which significantly reduces build time (especially for large packages like OpenCV) and disk space usage. This is already configured in CMakeLists.txt, but you can explicitly set it as shown above.
The Rerun C++ SDK (and its Arrow dependency) is disabled by default because the SDK is downloaded and built via FetchContent, which is heavy. The Python rerun-sdk package covers the same use cases for the notebooks. To opt in and build the rerun-dependent C++ targets, add the following flags:
cmake -S . -B build \
-DCMAKE_TOOLCHAIN_FILE=./vcpkg/scripts/buildsystems/vcpkg.cmake \
-DCMAKE_BUILD_TYPE=Release \
-DVCPKG_TARGET_TRIPLET=x64-linux-release \
-DBUILD_RERUN_EXAMPLES=ON \
-DVCPKG_MANIFEST_FEATURES=rerun
cmake --build build --parallel
conda create -n robotic_notes python=3.12 -y
conda activate robotic_notes
cd /home/$USER/anaconda3/envs/robotic_notes/
Create this soft link.
ln -s /home/$USER/workspace/robotic_notes /home/$USER/anaconda3/envs/robotic_notes/src
Install the python packages:
conda install -c conda-forge opencv
pip install graphslam
conda install conda-forge::gtsam
conda install conda-forge::matplotlib
conda install conda-forge::plotly
conda install -c conda-forge jupyterlab
pip install ahrs
pip install pyceres
pip install liegroups
pip install rerun-sdk[notebook]==0.29.2
pip install "gradio_rerun==0.29.2"
pip install "gradio==6.5.1"
pip install ipykernel
pip install jupyterlab
- Robotic System Design & Design Patterns
- 1. High-Level Architecture of a Humanoid Robot
- 2. Perception Pipeline: Consistency, Robustness, Synchronization
- 3. High-Frequency Sensor Pipelines in C++
- 4. Concurrency & Real-Time: SPMC Sensor Distribution
- 5. Task Execution: State Machine vs Behavior Tree
- 6. Debugging a Regressing Perception Model in Production
- 7. Time Synchronization
- 8. Safety & Fault Tolerance
- 9. Logging, Observability & Replay
- Configuration of Robot
- Configuration Space - (C-space )
- Degrees of freedom.
- Task Space
- Work Space
- Dexterous space:
- dof
- Topology
- Algebraic topology (playlist)
- Non-Holonomic Constraints, Pfaffian Constraints and Holonomic Constraints
- Kinematics of Differential Drive Robots and Wheel odometry
- Velocity-based (dead reckoning)
- Nonlinear uncertainty model associated with a robot's position over time (The Banana Distribution is Gaussian)
- Coordinate Frame Conventions — ENU, NED, FRD, FLU, RDF, RUB
- 1. Global References
- 2. Accelerometer Model
- 3. Gyroscope Model
- 4. Attitude from gravity (Tilt)
- Expressing IMU reading with Quaternion
- 5. Quaternion from Accelerometer
- 6. Quaternion Integration
- 7.1 Quaternion Derivative
- Relationship Between Euler-Angle Rates and Body-Axis Rates
- Complementary Filter
- Quaternion-Based Complementary Filter
- Accelerometer-Based Correction
- Attitude from angular rate (Attitude propagation)
- IMU Integration
- Noise Spectral Density
- Signal-to-noise Ratio
- Allan Variance curve
- Autoregressive model
- Madgwick Orientation Filter
- Mahony Orientation Filter
- Simulating IMU Measurements
- IMU Propagation Derivations
- IMU Noise Model
- The standard deviation of the discrete-time noise process
- Square binary fiducials (ArUco, AprilTag
- Composite calibration boards (ChArUco, AprilGrid)
- Which marker should I use?
- AprilTag — coordinate frame and measurements
- Generating AprilTags
- Detecting AprilTags (C++, Python, ROS)
- Common pitfalls
- Choosing a calibration target
- Calibration target files (AprilGrid PDF, YAMLs)
- Supported camera models and distortion
- Building the Kalibr Docker image
- Recording the calibration bag
- Camera-only calibration
- Camera–IMU calibration
- Common pitfalls
- 1. Names
- 2. Remapping Arguments
- NodeHandles
- Roslaunch
- URDF
- Publishing the State
- ROS best practices
- move_base
- ROS Odometery Model
- ROS State Estimation
- EKF Implementations
- Differential Drive Wheel Systems
- Choosing a distro
- Pull the image
- Run a persistent container
- Extending the image with a Dockerfile
- Docker Compose for multi-container setups
- ROS 2 ↔ ROS 1 bridge
- ROS 2 inside the container
- Colcon
- Packages
- Xacro
- Launch files
- Nav2
- teleop_twist_keyboard
- Useful container shortcuts
- Gazebo Versions
- Installation
- Building a model
- Building world
- Moving the robot
- Sensors
- Spawn URDF
- ROS 2 integration
- Distro / Gazebo mapping
- Dockerfile (Jazzy + Harmonic + ros_gz)
- Running the container
- GPU access (NVIDIA)
- Launching Gazebo
ros_gz_bridgeessentials- Demo recipes
- Troubleshooting
- Probability Review (Marginals/conditionals, probability vs likelihood, conditional Gaussians ) (foundational primer)
- Bayes Filter
- Maximum A-Posteriori (MAP) Estimation
- Kalman Filter
- Extended Kalman Filter
- Extended Kalman Filter for Differential Drive Robot
- Error State Extended Kalman Filter
- Error State Extended Kalman Filter(IMU, a GNSS, and a LiDAR)
- Multi-State Constraint Kalman Filter (MSCKF)
- Quaternion kinematics for the error-state Kalman filter
- Active Exposure Control for Robust Visual Odometry in HDR Environments
- Pose Graph SLAM from Scratch
- nano-pgo
- General Graph Optimization g2o explained
- Factor Graphs, Bayes Trees, and iSAM2 (with GTSAM)
- Resilient Autonomy in Perceptually-degraded Environments
- HBA Large-Scale LiDAR Mapping Module
- Hierarchical, multi-resolution volumetric mapping (wavemap)
- kiss-icp
- TagSLAM SLAM with tags
- OpenDroneMap
- Interactive SLAM
- Volumetric TSDF Fusion of Multiple Depth Maps
- Euclidean Signed Distance Field (ESDF)
- Lidar odometry smoothing using ES EKF and KissICP for Ouster sensors with IMUs
- Multisensor-aided Inertial Navigation System (MINS)
- GLOMAP explained
- Zero-Shot Point Cloud Registration
- Add Apriltag to loop closure
- Navtech Radar SLAM
- Procrustes Analysis
- Wahba's Problem
- Quaternion Estimator Algorithm (QUEST)
- Kabsch Algorithm
- Umeyama Algorithm
- Iterative Closest Point (ICP)
The vio_benchmark/ subproject is a self-contained
framework for comparing visual-inertial odometry estimators against
public datasets (EuRoC MAV, TUM-VIO, and others). It bundles OpenVINS
and VINS-Fusion as submodules, ships per-dataset calibration files
and ground-truth references, and includes pipeline scripts plus a
rerun.io visualisation that overlays multiple trajectories with the
live stereo camera projected inside each estimator's frustum. First
head-to-head result: OpenVINS 0.295 m APE vs VINS-Fusion 0.248 m on
EuRoC MH_01_easy.
- Trajectory analysis methodology and case studies
- Diagnostic procedure for broken VIO
- Estimator parameter reference
- Visual-Inertial Navigation Systems: An Introduction
- IMU Propagation Derivations (OpenVINS)
- Error State Kalman Filter VIO (ESKF-VIO)
- Head-to-head — OpenVINS vs VINS-Fusion on EuRoC MH_01_easy
- Dataset recipes — EuRoC, TUM-VIO, M2DGR, 4Seasons
- Debug session log
- Hilti x Trimble SLAM Challenge 2026
- Benchmark Comparison of Monocular Visual-Inertial Odometry Algorithms for Flying Robots
- A Comparison of Modern General-Purpose Visual SLAM Approaches
- ETH3D
- rvp group
- M3DGR — Multi-sensor, Multi-scenario, Massive-baseline SLAM Dataset for Ground Robots
The lio_benchmark/ subproject is a self-contained
Gazebo simulator for LiDAR-inertial odometry development. It ships a
SubT-style rover with a 16-beam LiDAR + IMU + cameras, three
cave/tunnel worlds, and FAST-LIO wired into the rover by default.
One-line bring-up via docker compose. Unlike VIO, LIO works
correctly on this sim (FAST-LIO consistently produces 0.3–1.5 %
end-point error on rover recordings) — the architectural reason is in
the VIO diagnostic guide §3.
- lio_benchmark README and quick start
- Simulator GPU/rendering performance notes
- FAST-LIO + LiDAR tuning reference
- Why LIO is robust to bad IMU but VIO isn't
- Robust Real-time LiDAR-inertial Initialization
- Lidar SLAM for Automated Driving (MATLAB)
- DSEC — Stereo Event Camera Dataset for Driving Scenarios
- FAST-LIO
- iG-LIO — incremental GICP tightly-coupled LIO
- DLIO — Direct LiDAR-Inertial Odometry with Continuous-Time Motion Correction
- CT-LIO — Continuous-Time LiDAR-Inertial Odometry
- LIO-SAM
- GLIM — versatile and extensible range-based 3D mapping framework
- LIMO — Lidar-Monocular Visual Odometry
- Structure from Motion from Scratch
- Robust Rotation Averaging
- Bundler
- Noah Snavely Reprojection Error
- Global Structure-from-Motion Revisited
- LightGlue
- XRefine: Attention-Guided Keypoint Match Refinement
- DenseSFM
- Pixel-Perfect Structure-from-Motion
- image-matching-webui
- Gaussian Splatting
- GANeRF
- DSAC*
- Tracking Any Point (TAP)
- image-matching-benchmark
- Local Feature Matching at Light Speed
- Hierarchical Localization
- instant-ngp
- NeRF-SLAM
- DROID-SLAM
- ACE0
- A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets -
- DoubleTake:Geometry Guided Depth Estimation
- Mitigating Motion Blur in Neural Radiance Fields with Events and Frames
- LEAP-VO: Long-term Effective Any Point Tracking for Visual Odometry
- MegaScenes: Scene-Level View Synthesis at Scale
- Intrinsic Image Diffusion for Indoor Single-view Material Estimation
- Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian Surfels
- Detector-Free Structure from Motion
- Continuous 3D Perception Model with Persistent State
- MegaSam: Accurate, Fast and Robust Casual Structure and Motion from Casual Dynamic Videos
- Mast3r Slam with Rerun
- Stereo Any Video:Temporally Consistent Stereo Matching
- Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
- UniK3D: Universal Camera Monocular 3D Estimation
- Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera
- Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
- VGGT: Visual Geometry Grounded Transformer
- MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAM
- Speedy MASt3R
- CURL-MAP
- Depth-Anything-3
- SLAM3R Real-Time Dense Scene Reconstruction from Monocular RGB Videos
- Human3R: Everyone Everywhere All at Once
- Describe Anything, Anywhere, at Any Moment (DAAAM)
- Generative View Stitching
- Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation (ICLR 2026)
- VGGSfM
- InstantSfM
- MatchAnything
- DepthNet, PoseNet
- ViT for Monocular Visual odometry
- VO Loss Functions
- VO Evaluation Metrics
- Photometric Reprojection Loss
- Edge-Aware Depth Smoothness
- SSIM (Structural Similarity)
- Understanding Poses
- Modern Robotics Mechanics, Planning, and Control (Kevin M. Lynch, Frank C. Park)
- Modern Robotics Mechanics, Planning, and Control (Instructor Solution Manual, Solutions )
- MODERN ROBOTICS MECHANICS, PLANNING, AND CONTROL (Practice Exercises)
- Basic Knowledge on Visual SLAM: From Theory to Practice, by Xiang Gao, Tao Zhang, Qinrui Yan and Yi Liu
- STATE ESTIMATION FOR ROBOTICS (Timothy D. Barfoot)
- SLAM for Dummies
- VSLAM Handbook
- SLAM Handbook
- Matrix Calculus (for Machine Learning and Beyond)
- Reinforcement Learning: A Comprehensive Overview
- Linear Algebra for Computer Vision, Robotics, and Machine Learning