DEV Community

Jangwook Kim profile picture

Jangwook Kim

404 bio not found

Joined Joined on  Personal website https://effloow.com
Claude Design and Claude Routines: Anthropic's New Agentic Products

Claude Design and Claude Routines: Anthropic's New Agentic Products

Comments
9 min read
RAGFlow: Self-Host a Deep-Document RAG Engine

RAGFlow: Self-Host a Deep-Document RAG Engine

Comments
10 min read
Claude Haiku 4.5: When to Use It Over Sonnet 4.6

Claude Haiku 4.5: When to Use It Over Sonnet 4.6

Comments
9 min read
Google ADK vs LangGraph 2026: I Installed Both and Compared Them Side by Side

Google ADK vs LangGraph 2026: I Installed Both and Compared Them Side by Side

Comments
6 min read
Microsoft Agent 365: AI Agent Governance for Developers

Microsoft Agent 365: AI Agent Governance for Developers

Comments
10 min read
Temporal for AI Agents: Durable Execution Guide 2026

Temporal for AI Agents: Durable Execution Guide 2026

Comments
9 min read
Intel OpenVINO 2026.0: Run LLMs on NPU for Free

Intel OpenVINO 2026.0: Run LLMs on NPU for Free

Comments
9 min read
POLARIS: Typed DAG Planning for Governed AI Agents

POLARIS: Typed DAG Planning for Governed AI Agents

Comments
10 min read
Cloudflare Moltworker: Self-Hosted AI Agents Without Hardware

Cloudflare Moltworker: Self-Hosted AI Agents Without Hardware

Comments
10 min read
Mercury 2: Inception's Diffusion LLM at 1,000 Tokens/s

Mercury 2: Inception's Diffusion LLM at 1,000 Tokens/s

Comments
9 min read
Langfuse v3 Self-Hosting Complete Guide — Building LLM Tracing on Your Own Infrastructure

Langfuse v3 Self-Hosting Complete Guide — Building LLM Tracing on Your Own Infrastructure

Comments
7 min read
Microsoft Agent Governance Toolkit: OWASP Agentic AI Top 10

Microsoft Agent Governance Toolkit: OWASP Agentic AI Top 10

Comments 1
10 min read
Cloudflare AI Gateway: Zero-Config LLM Proxy for Production

Cloudflare AI Gateway: Zero-Config LLM Proxy for Production

Comments
11 min read
Gemini 3.1 Flash TTS: Production API Guide for Developers

Gemini 3.1 Flash TTS: Production API Guide for Developers

Comments
8 min read
Why Anthropic Cut Off OpenClaw — The Claude Subscription Policy Shift and What It Costs You

Why Anthropic Cut Off OpenClaw — The Claude Subscription Policy Shift and What It Costs You

Comments
8 min read
Xiaomi MiMo-V2.5-Pro: Open-Source 1T Coding Agent Guide 2026

Xiaomi MiMo-V2.5-Pro: Open-Source 1T Coding Agent Guide 2026

Comments
9 min read
Devstral 2: Run Mistral's Open Coding Agent Locally

Devstral 2: Run Mistral's Open Coding Agent Locally

Comments
9 min read
Gemma 4 26B vs 31B: Which Model to Run Locally

Gemma 4 26B vs 31B: Which Model to Run Locally

Comments
10 min read
Anthropic's April Double Release — How Opus 4.7 and Managed Agents Change Agent Development

Anthropic's April Double Release — How Opus 4.7 and Managed Agents Change Agent Development

Comments
7 min read
Token Optimization for Production LLMs: Cut Costs Effectively

Token Optimization for Production LLMs: Cut Costs Effectively

Comments
10 min read
Build an MCP Server with TypeScript: 2026 Tutorial

Build an MCP Server with TypeScript: 2026 Tutorial

2
Comments
9 min read
LLM Prompt Caching in Production: Cut API Costs 78% With Claude

LLM Prompt Caching in Production: Cut API Costs 78% With Claude

Comments
7 min read
Claude Streaming + Tool Use: Build Real-Time Agentic Pipelines

Claude Streaming + Tool Use: Build Real-Time Agentic Pipelines

Comments
6 min read
OpenAI o3 Pro API: Maximum Reasoning for Hard Tasks

OpenAI o3 Pro API: Maximum Reasoning for Hard Tasks

Comments
9 min read
How to Build a PR Auto-Review Pipeline with GitHub Actions + Claude Code CLI

How to Build a PR Auto-Review Pipeline with GitHub Actions + Claude Code CLI

Comments
8 min read
DSPy 3.x: Compile and Optimize LLM Pipelines Automatically

DSPy 3.x: Compile and Optimize LLM Pipelines Automatically

Comments
9 min read
smolagents + MCP Bridge: Connect Any Tool to Your Agent

smolagents + MCP Bridge: Connect Any Tool to Your Agent

1
Comments
10 min read
Arcee Trinity Large Thinking: Open Source 400B Reasoning Guide

Arcee Trinity Large Thinking: Open Source 400B Reasoning Guide

Comments
9 min read
PydanticAI Practical Tutorial — Building Type-Safe AI Agents the FastAPI Way

PydanticAI Practical Tutorial — Building Type-Safe AI Agents the FastAPI Way

Comments
8 min read
MiniMax M2.5 API Guide: 80% SWE-Bench at $0.15/M Tokens

MiniMax M2.5 API Guide: 80% SWE-Bench at $0.15/M Tokens

Comments
10 min read
A2A Protocol PoC: Build an Agent Server in Python

A2A Protocol PoC: Build an Agent Server in Python

Comments
8 min read
Warp 2.0: The Terminal That Became an Agentic Development Environment

Warp 2.0: The Terminal That Became an Agentic Development Environment

Comments
12 min read
Anthropic Message Batches API Production Guide — Cut LLM Costs 50% at Scale

Anthropic Message Batches API Production Guide — Cut LLM Costs 50% at Scale

Comments
8 min read
vLLM 0.8: Native Llama 4 MoE Routing Explained

vLLM 0.8: Native Llama 4 MoE Routing Explained

Comments
10 min read
Claude Opus 4.7: Effort Controls and Migration Guide

Claude Opus 4.7: Effort Controls and Migration Guide

Comments
9 min read
AI Distiller: Extract LLM-Ready Code Context in Seconds

AI Distiller: Extract LLM-Ready Code Context in Seconds

1
Comments
9 min read
Claude API Prompt Caching in Practice — 4 Patterns That Cut LLM Costs by 70%

Claude API Prompt Caching in Practice — 4 Patterns That Cut LLM Costs by 70%

Comments
7 min read
markitdown: Convert Any Document to Markdown for LLMs

markitdown: Convert Any Document to Markdown for LLMs

Comments
8 min read
On-Device AI 2026: Developer Guide to NPUs and Edge Inference

On-Device AI 2026: Developer Guide to NPUs and Edge Inference

Comments
12 min read
Cursor 3 vs Claude Code vs Windsurf — Which AI Coding Tool Should You Use in 2026?

Cursor 3 vs Claude Code vs Windsurf — Which AI Coding Tool Should You Use in 2026?

Comments
10 min read
ChatGPT Workspace Agents: OpenAI's Enterprise Agent Platform

ChatGPT Workspace Agents: OpenAI's Enterprise Agent Platform

Comments
10 min read
Google Gemini Enterprise Agent Platform: Build and Deploy A2A Agents

Google Gemini Enterprise Agent Platform: Build and Deploy A2A Agents

Comments
10 min read
nanobot: Build AI Agents in 4,000 Lines You Can Actually Read

nanobot: Build AI Agents in 4,000 Lines You Can Actually Read

Comments
9 min read
MCP vs A2A vs Open Responses — AI Agent Communication Protocols in 2026: What to Actually Use

MCP vs A2A vs Open Responses — AI Agent Communication Protocols in 2026: What to Actually Use

Comments
6 min read
Meta Llama Stack: Deploy Llama 4 With OpenAI-Compatible API

Meta Llama Stack: Deploy Llama 4 With OpenAI-Compatible API

Comments
9 min read
DeepSeek V4-Pro and V4-Flash: Migration Guide and API Setup

DeepSeek V4-Pro and V4-Flash: Migration Guide and API Setup

Comments
11 min read
GPT-5.5 Spud: Unified Multimodal API — Developer Integration Guide

GPT-5.5 Spud: Unified Multimodal API — Developer Integration Guide

Comments
10 min read
>-

>-

Comments
9 min read
Cursor 2.0: 8 Parallel AI Agents and Visual Editor Bridge

Cursor 2.0: 8 Parallel AI Agents and Visual Editor Bridge

Comments
10 min read
Llama 4 Maverick: 400B MoE Model — Self-Hosting and API Guide

Llama 4 Maverick: 400B MoE Model — Self-Hosting and API Guide

Comments
8 min read
Databricks Unity AI Gateway: MCP Agent Governance Guide

Databricks Unity AI Gateway: MCP Agent Governance Guide

Comments
11 min read
Building a Claude Streaming Agent with Vercel AI SDK

Building a Claude Streaming Agent with Vercel AI SDK

Comments
9 min read
GitLab 18.11: Agentic AI for Security, CI, and Analytics

GitLab 18.11: Agentic AI for Security, CI, and Analytics

Comments
10 min read
Kimi Code K2.6: Moonshot AI's Coding Model vs Claude Code

Kimi Code K2.6: Moonshot AI's Coding Model vs Claude Code

Comments
11 min read
Qwen3.6-Plus: 1M Token Context and Claude-Level Performance

Qwen3.6-Plus: 1M Token Context and Claude-Level Performance

Comments
10 min read
Claude Code Routines Practical Guide — How to Automate AI Tasks 24/7 with Schedules, APIs, and GitHub Events

Claude Code Routines Practical Guide — How to Automate AI Tasks 24/7 with Schedules, APIs, and GitHub Events

Comments
8 min read
LLM Inference Engines Compared 2026: vLLM vs SGLang vs TGI vs MAX

LLM Inference Engines Compared 2026: vLLM vs SGLang vs TGI vs MAX

Comments
10 min read
smolagents: Build Code Agents with HF in Under 100 Lines

smolagents: Build Code Agents with HF in Under 100 Lines

Comments
11 min read
OpenClaw: Self-Hosted AI Gateway for WhatsApp, Telegram & Discord

OpenClaw: Self-Hosted AI Gateway for WhatsApp, Telegram & Discord

Comments
11 min read
MCP Server Kubernetes Deployment — Surviving the 52% Death Rate

MCP Server Kubernetes Deployment — Surviving the 52% Death Rate

Comments
8 min read
loading...