DEV Community

Siyana Hristova profile picture

Siyana Hristova

Data, culture, architecture, solopreneurship. Creator of Similarity API. Check it out at https://similarity-api.com/

Location Los Angeles, CA Joined Joined on  Personal website https://similarity-api.com/
How to Fuzzy-Match 1 Million Rows in BigQuery in under 10 minutes

How to Fuzzy-Match 1 Million Rows in BigQuery in under 10 minutes

Comments
4 min read
How to Reconcile Salesforce Leads Against Contacts at Scale

How to Reconcile Salesforce Leads Against Contacts at Scale

Comments
3 min read
How to fuzzy-match a 1M-row dataset to a canonical reference in under 10 minutes (2026 guide)

How to fuzzy-match a 1M-row dataset to a canonical reference in under 10 minutes (2026 guide)

Comments
3 min read
Why It Rarely Makes Sense to Build Fuzzy Matching Yourself in 2026

Why It Rarely Makes Sense to Build Fuzzy Matching Yourself in 2026

1
Comments
2 min read
Fuzzy-match 1M rows in under 10 minutes (2026 Edition)

Fuzzy-match 1M rows in under 10 minutes (2026 Edition)

2
Comments
3 min read
Fuzzy-match millions of rows in Databricks (2026)

Fuzzy-match millions of rows in Databricks (2026)

9
Comments
5 min read
Scaling Fuzzy Matching: From Local Scripts to Production Pipelines

Scaling Fuzzy Matching: From Local Scripts to Production Pipelines

7
Comments
5 min read
I built a fuzzy matching engine that's 300x faster than RapidFuzz on 1M records

I built a fuzzy matching engine that's 300x faster than RapidFuzz on 1M records

1
Comments
2 min read
loading...