DEV Community

soy profile picture

soy

Patent lawyer turned AI engineer. Processed 4M patents with local LLM on RTX 5090. Building PatentLLM — AI-powered patent search. Also ranked #1 on Floodgate (shogi AI). Writing about local LLM etc.

I Built a Free Patent Search Engine with 3.5M US Patents — No Login, Powered by SQLite FTS5

I Built a Free Patent Search Engine with 3.5M US Patents — No Login, Powered by SQLite FTS5

Comments 1
3 min read

Want to connect with soy?

Create an account to connect with soy. You can also sign in below to proceed if you already have an account.

Already have an account? Sign in
Operational Techniques for Automatically Starting vLLM, Flask, and cron with systemd Services in WSL2

Operational Techniques for Automatically Starting vLLM, Flask, and cron with systemd Services in WSL2

Comments
3 min read
Achieving Bidirectional Integration of Streamlit Backend Flutter Frontend in a WSL2 Environment

Achieving Bidirectional Integration of Streamlit Backend Flutter Frontend in a WSL2 Environment

Comments
2 min read
A Regulatory Analysis Dashboard for Fast Searching NITE CHRIP Data using FTS5

A Regulatory Analysis Dashboard for Fast Searching NITE CHRIP Data using FTS5

Comments
2 min read
Searching Case Law PDFs with RAG — A Legal AI Search System using Gemini + SQLite FTS5

Searching Case Law PDFs with RAG — A Legal AI Search System using Gemini + SQLite FTS5

Comments
3 min read
google-generativeai google-genai Migration Guide

google-generativeai google-genai Migration Guide

Comments
2 min read
Gemini 2.5 Flash x Nemotron 9B — Optimal Division of Roles for Cloud LLM and Local LLM

Gemini 2.5 Flash x Nemotron 9B — Optimal Division of Roles for Cloud LLM and Local LLM

Comments
3 min read
Reduce API Costs for Large-Scale Document Analysis with Gemini Context Caching

Reduce API Costs for Large-Scale Document Analysis with Gemini Context Caching

Comments
2 min read
Skit: The Man Obsessed with Claude Code

Skit: The Man Obsessed with Claude Code

Comments
3 min read
Building a Free Research Agent with DuckDuckGo Search + Local LLM

Building a Free Research Agent with DuckDuckGo Search + Local LLM

Comments
2 min read
A Daily Report System to Automatically Aggregate Claude Code + Gemini CLI Usage History Every Morning with Cron

A Daily Report System to Automatically Aggregate Claude Code + Gemini CLI Usage History Every Morning with Cron

Comments
2 min read
Reducing Token Consumption in Claude Code — FTS5 Knowledge DB + Tiered Index Design

Reducing Token Consumption in Claude Code — FTS5 Knowledge DB + Tiered Index Design

Comments 1
2 min read
Implementing Stripe Checkout Billing in PatentLLM

Implementing Stripe Checkout Billing in PatentLLM

Comments
2 min read
Building a 5-in-1 App with Local LLM and Flutter

Building a 5-in-1 App with Local LLM and Flutter

Comments
2 min read
Leveraging Claude Code's MCP Server

Leveraging Claude Code's MCP Server

Comments 1
2 min read
LoRA and FT Are Unnecessary: How to Approach Distilled Models

LoRA and FT Are Unnecessary: How to Approach Distilled Models

Comments
2 min read
Lineage of OSS Supporting the AI Development Stack: Its Origins and Creators

Lineage of OSS Supporting the AI Development Stack: Its Origins and Creators

Comments
6 min read
Running NVIDIA Nemotron-Nano-9B-v2-Japanese Locally: Mamba SSM + Thinking Mode Support

Running NVIDIA Nemotron-Nano-9B-v2-Japanese Locally: Mamba SSM + Thinking Mode Support

Comments
2 min read
Strategic Data Organization Techniques Using SQLite, JSONL, XML, and TSV: Lessons

Strategic Data Organization Techniques Using SQLite, JSONL, XML, and TSV: Lessons

Comments
3 min read
Shogi AI with RTX 5090 — Record of TensorRT FP8 Quantization and Floodgate Practical Games

Shogi AI with RTX 5090 — Record of TensorRT FP8 Quantization and Floodgate Practical Games

Comments
2 min read
Practical Guide to Running Nemotron-Nano-9B-v2-Japanese with vLLM and Integrating it into Your Custom Application via an Open...

Practical Guide to Running Nemotron-Nano-9B-v2-Japanese with vLLM and Integrating it into Your Custom Application via an Open...

Comments
6 min read
Python Environment Management with uv: Introduction and Practical Use of a High-Speed Package Manager Replacing pip/venv

Python Environment Management with uv: Introduction and Practical Use of a High-Speed Package Manager Replacing pip/venv

Comments
3 min read
Automatically Prevent Port Conflicts and Dangerous Commands Proactively with Claude Code's Hooks Feature

Automatically Prevent Port Conflicts and Dangerous Commands Proactively with Claude Code's Hooks Feature

Comments 1
2 min read
Giving a 'Brain' to Minecraft NPCs with a Local LLM — Nemotron + Mineflayer Implementation Notes

Giving a 'Brain' to Minecraft NPCs with a Local LLM — Nemotron + Mineflayer Implementation Notes

Comments
3 min read
Exposing Multiple Web Applications from a Home Server with Cloudflare Tunnel + Caddy

Exposing Multiple Web Applications from a Home Server with Cloudflare Tunnel + Caddy

Comments
2 min read
Personal AI Development Environment Built with RTX 5090 + WSL2 — A Practical Setup Fully Utilizing 32GB GPU

Personal AI Development Environment Built with RTX 5090 + WSL2 — A Practical Setup Fully Utilizing 32GB GPU

Comments
2 min read
Individual Developer's Portfolio Strategy: Running 13 Projects on a Single RTX 5090

Individual Developer's Portfolio Strategy: Running 13 Projects on a Single RTX 5090

Comments
2 min read
Using Local LLMs as a "Batch Processing Engine" — A Design for Automatically Generating Artifacts from Your Own Data

Using Local LLMs as a "Batch Processing Engine" — A Design for Automatically Generating Artifacts from Your Own Data

Comments
10 min read
Fast Searching 4 Million Patent Records with FTS5

Fast Searching 4 Million Patent Records with FTS5

Comments
2 min read
loading...