Bogdan's Docs

Table of Contents

Table of Contents

  • Artificial Intelligence
    • February 25th, 2025
    • February 18th, 2025
    • March 11th, 2024
    • January 15th, 2024
    • January 8th, 2024
    • December 11th, 2023
    • December 4th, 2023
    • September 18th, 2023
    • September 11th, 2023
    • September 4th, 2023
    • August 21th, 2023
    • June 27th, 2023
    • June 19th, 2023
    • June 12th, 2023
    • May 29th, 2023
    • May 15th, 2023
    • May 8th, 2023
    • May 1st, 2023
    • April 24th, 2023
    • April 17th, 2023
    • April 10th, 2023
    • April 3rd, 2023
    • March 27th, 2023
    • March 20th, 2023
    • March 13th, 2023
    • March 6th, 2023
    • Feb 19th, 2023

Ideas

  • Artificial Intelligence
    • Bond Pricing
    • Kubernetes Inference
    • Knowledge Bot
    • Ready to Use Models
    • Satellite Damage Detection

Curs IoT si AI (ro) Resources

  • Links
    • Artificial Intelligence
  • Recipes
    • MacOS
Bogdan's Docs
Docs » resources:links:artificial_intelligence

Artificial Intelligence

February 25th, 2025

Programming (General):

  • Concurrency in Go, Pony, Erlang (Elixir) and Rust

Programming (Python):

  • 9 MindBlowing Python Features You Aren’t Using Enough
  • Stop making your python projects like it was 15 years ago...
  • Use Design Patterns in Python: Are You Still Writing Procedure-Oriented Code?

LLMs & AI:

  • Goodbye, Text2SQL: Why Table-Augmented Generation (TAG) is the Future of AI-Driven Data Queries!
  • What’s the Best PDF Extractor for RAG? I Tried LlamaParse, Unstructured and Vectorize
  • Fabric: The Best AI Tool That Nobody is Talking About
  • 15 AI Agent Business Ideas in 2025
  • 6 AI Agents That Are So Good, They Feel Illegal
  • Prompt Decorators: A Simple Way to Improve AI Responses
  • Data Science All Algorithm Cheatsheet 2025
  • Every AI Engineer Should Know these 15 Python Libraries in 2025

Management:

  • The Slow, Painful Death Of Agile and Jira

February 18th, 2025

  • KAG: A Better Alternative to RAG for Domain-Specific Knowledge Applications
  • Debugging in Python: Replace print() with ic() and Do It Like a Pro
  • ChatGPT for Data Analysis: A Beginner’s Guide
  • Why Building Your AI Agent Could Be Your Most Valuable Investment in 2025
  • Building Successful AI Apps: The Dos and Don’ts
  • Why Simple Code Is Harder Than You Think
  • 10 Awesome Developer Tools to Make Your Life Easier
  • Zonos: Text to speech

Bonus:

  • Hybrid Search a method to Optimize RAG implementation
  • Istio Workshop
  • Grounding LLM’s — Part 1

March 11th, 2024

  • Wisdom of Unstructured Data: Building Airbnb’s Listing Knowledge from Big Text Data
  • A Definitive Guide to Using BigQuery Efficiently
  • 7 Query Strategies for Navigating Knowledge Graphs With LlamaIndex
  • Constructing knowledge graphs from text using OpenAI functions
  • A first intro to Complex RAG (Retrieval Augmented Generation)
  • Advance RAG - Improve RAG performance

January 15th, 2024

  • Creating a Programming Language and Compiler with Python and LLVM
  • From ElasticSearch to Apache Doris: Upgrading an Observability Platform
  • System Design Cheatsheets: ElasticSearch
  • Cotainerization of Machine Learning Applications

January 8th, 2024

  • How we built a Real Time Order Tracking System using Event Driven Architecture.
  • Forget RAG: Embrace agent design for a more intelligent grounded ChatGPT!
  • How to Setup AWS Locally Using LocalStack Without Spending a Buck
  • Building a Real-Time Data Streaming Pipeline using Apache Kafka, Flink and Postgres
  • Manage Airflow connections with Terraform and AWS SecretsManager

December 11th, 2023

  • LMQL — SQL for Language Models
  • MLOps for recommenders - Deploying Recommender System in Production
  • How Kubernetes requests are implemented
  • Controlling Kubernetes Traffic with Topology Aware Routing
  • Using encrypted container images in a confidential VM

December 4th, 2023

  • What the heck is Malloy? - feels like JSON made a baby with SQL, in the worse possible way
  • What the heck is Apache SeaTunnel?
  • Spark Concepts Simplified: Cache, Persist, and Checkpoint
  • Wait! What are Pipelines in Python?
  • The ins and outs of Airflow’s new Secrets Cache
  • Deploying Apache Spark on a Local Kubernetes Cluster: A Comprehensive Guide
  • Kubeflow Pipelines: A step by step guide
  • Meet Michelangelo: Uber’s Machine Learning Platform
  • Ray is an open-source unified compute framework that makes it easy to scale AI and Python workloads

September 18th, 2023

  • How to serve machine learning model using FastAPI, MLFlow, MINIO and MySQL

September 11th, 2023

  • Concurrency with FastAPI
  • Introduction to Quant investing with python

September 4th, 2023

  • A data engineering project with Prefect, Docker, Terraform, Google CloudRun, BigQuery and Streamlit
  • Personalized Recommendations with Transformers
  • How does Facebook handle billions of async requests?
  • FastAPI Advanced Request Processing

August 21th, 2023

  • Causal Machine Learning for Creative Insights
  • How to build a 5 layer data stack
  • Meta's FairScale Documentation
  • Nginx vs Traefik v2 vs Traefik v3
  • PySpark style guide

June 27th, 2023

  • Using contextual bandit models in large action spaces at Instacart

June 19th, 2023

  • Think twice before using asyncio in Python
  • Data engineering is not software engineering
  • The Netflix Cosmos platform
  • 5 Must know distributed systems design patterns for event driven
  • Event Driven shopping app with Python + Kafka + BigQuery
  • Open sourcing a 10x reduction in Apache Cassandra tail latency
  • Deep multi-task learning and real time personalization for closeup recommendations

June 12th, 2023

  • How to make the most of Pydantic
  • Data Science at OLX
  • Evolution of Image Model Serving at OLX

May 29th, 2023

  • Writing design docs for data Pipelines
  • How we monitor thousands of spark data pipelines
  • How Netflix microservices tackle dataset pub-sub
  • Mage and its magical data plumbing powers
  • Top 10 books for mastering the art of building software at scale

May 15th, 2023

  • We're all using Airflow wrong and how to fix it

May 8th, 2023

  • SageMaker - Bring Your Own Container
  • Alternatives to Airflow
  • Building a Data Warehouse - Architectural Principles
  • Hystrix - Resilient APIs Connections

May 1st, 2023

  • Hazelcast as Redis Alternative
  • Estimating F1-score in the absence of ground thruth
  • How Discord Evolved from Storing Ms of Messages to Ts of Messages
  • Airbnb System Design

April 24th, 2023

  • 5 Data Architectures to Check in 2023
  • Terraform - Deploying Secure Highly Available and Fault Tolerant Cloud
  • How Razorpay Scaled up Their Notification Service
  • Spark Runtime for AWS Lambda
  • Bolt Pipeline and Data Monitoring

April 17th, 2023

  • Learn to use Kubernetes CRDs

April 10th, 2023

  • About Optimizations
  • Using files and devices in Podman rootless containers
  • Brief intro in Open Table Formats
  • Machine Learning Orchestration with Airflow (lectures material)
  • MLOps Project Part 1

April 3rd, 2023

  • HRNN Recommender
  • Pseudonymisation Techniques and Best Practices
  • DuckDB - what's the hype about
  • DuckDB Poor Man's DataLake

March 27th, 2023

  • Boto3 vs SageMaker Python API
  • DBT vs Delta Live Tables
  • Airflow Custom Operators
  • Writing a Good Airflow Dag
  • Flyte, Lift's alternative to AirFlow (Wolt ML Infra)
  • Build an ML Platform from Scratch
  • 13 SQL statements for 90% of Data Science (lectures material)
  • Build a minimal Linux with Busybox (lectures material)

March 20th, 2023

  • K3S Cluster with Nginx Ingress
  • Nginx Ingress
  • Nginx Controller via YAML

March 13th, 2023

  • Install Jupyterhub with Kubespawner

March 6th, 2023

  • Scaling AllenNLP in Production
  • Most common data architectures in AWS
  • Feature stores explained - the 3 common architectures
  • Bring your own container with Amazon SageMaker
  • Amazon SageMaker containers overview
  • Data Ingestion Pipeline with Operation Management @ Netflix
  • Configuring uWSGI for Production

Feb 19th, 2023

  • Gunicorn - important configuration options with a mind towards performance
  • Memray - Bloomberg's Python memory profiler
  • File storage format for chunked, compressed, N-dimensional arrays
  • What are AWS Sagemaker Endpoints?
  • Running 1000 models a month with Apache Airflow on AWS
  • Understanding Memory Mapping
Previous Next