DEV Community

# apachespark

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
How I debugged a Delta Lake DESCRIBE HISTORY timeout (and what's actually causing it)

How I debugged a Delta Lake DESCRIBE HISTORY timeout (and what's actually causing it)

Comments
4 min read
Your Customer Table Has Duplicates You Can't See With SQL How I Built a Cross-Platform Identity Resolution Layer for a Dark Kitchen Data Platform

Your Customer Table Has Duplicates You Can't See With SQL How I Built a Cross-Platform Identity Resolution Layer for a Dark Kitchen Data Platform

3
Comments
8 min read
🚀 Apache Spark Just Killed the Microbatch Barrier (And Why Flink Should Be Worried)

🚀 Apache Spark Just Killed the Microbatch Barrier (And Why Flink Should Be Worried)

1
Comments
3 min read
Should you join Data Engineering?A guide to the tools you'll use

Should you join Data Engineering?A guide to the tools you'll use

10
Comments
2 min read
Using Gravitino with Apache Spark for ETL

Using Gravitino with Apache Spark for ETL

Comments
7 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.