SQL Anti-Patterns You Should Avoid

https://datamethods.substack.com/p/sql-anti-patterns-you-should-avoid

41•zekrom•2h ago

Comments

jwsteigerwalt•30m ago

That’s my rap sheet…

JohnHaugeland•29m ago

these aren’t anti patterns. these are just things you shouldn’t do

em500•26m ago

Still waiting for the definitive article on why using the term anti-pattern is an anti-pattern.

readthenotes1•4m ago

If a pattern is a common problem (e.g., becoming accustomed to a spectacular view) and generally-useful solution to that problem (blocking the view so that effort is required to obtain it), then an anti-pattern is what?

I think most people think an anti-pattern is an aberration in the "solution" section that creates more problems.

So here, the anti-pattern is that people use a term so casually (e.g., DevOps) that no one knows what it's referring to anymore.

(The problem: need a way to refer to concept(s) in a pithy way. The solution: make up or reuse an existing word/phrase to incorporate the concept(s) by reference so that it can can, unambiguously, be used as a replacement for the longer description. )

jacknews•26m ago

"When handling large CASE WHEN statements, it is better to create a dimension table or view, ideally sourced from the landed table where the original status column is populated."

Is this code for 'use a lookup table' or am I falling behind on the terminology? The modern term should be 'sum table' or something similar surely.

LikesPwsh•12m ago

"Dimension table" is the name for lookup tables in a star or snowflake schema.

jacknews•3m ago

Thanks.

'Landed table'?

parpfish•2m ago

but sometimes large case statements cant be turned into a simple dimension table/lookup table because it's not a simple key-value transformation.

if your case statement is just a series of straighahead "WHEN x=this THEN that", you're very lucky.

the nasty case statements are the ones were the when expression sometimes uses different pieces of data and/or the ordering of the statements is important.

jasonpbecker•25m ago

We did the views on view thing once when triggers, at least how we implemented them failed. This became a huge regret that we lived with for years and not-so affectionately called "view mountain". We finally slayed viewed mountain over the last 2 years and it feels so good.

chongli•24m ago

When working with larger enterprise software, it is common to have large CASE WHEN statements translating application status codes into plain English. For example, status code 1 could mean the item is out of stock.

Why wouldn’t you store this information in a table and query it when you need it? What if you need to support other languages? With a table you can just add more columns for more languages!

megaman821•10m ago

I usually use generated columns for this. It still uses CASE WHEN but it is obvious to all consumers of the table that it exists.

anthonyIPH•24m ago

"Instead you should:

query WHERE name = ‘abc’

create an indexed UPPER(name) column"

Should there be an "or" between these 2 points, or am I missing something? Why create an UPPER index column and not use it?

wmonk•18m ago

The section of using functions on indexes could do with more explicit and deeper explanation. When you use the function on the index it becomes a full scan of the data instead as the query runner has to run the function on every row and column, effectively removing any benefit of the index.

Unfortunately I learned this the hard way!

LikesPwsh•15m ago

Some well known docs on the topic- https://use-the-index-luke.com/sql/where-clause/obfuscation

readthenotes1•13m ago

"Unfortunately I learned this the hard way!" ... Seems to be the motto of SQL developers.

Otoh, it seems a fairly stable language (family of dialects?) so finding the pitfalls has long leverage

EvanAnderson•8m ago

> Overusing DISTINCT to “Fix” Duplicates

Any time I see DISTINCT in a query I immediately become suspicious that the query author has an incomplete understanding of the data model, a lack of comprehension of set theory, or more likely both.

Sesse__•4m ago

Or just doesn't know how to do semijoins in SQL, since they don't follow the same syntax as normal joins for whatever historical reason.

dgb23•2m ago

If „select *“ breaks your code, then there‘s something wrong with your code. I think Rich Hickey talked about this. Providing more than is needed should never be a breaking change.

Certain languages, formats and tools do this correctly by default. For the others you need a source of truth that you generate from.

Best xkcd

The most dangerous corner of a balance-sheet

Hard disk LEDs and noisy machines

Show HN: GPU Rank, a dataset of GitHub repos that utilize heterogenous computing

Build Server Protocol

Australian Climate Risk Assessment issues dire warnings

BBC computer literacy project archive

Happy International Repair Day 2025

Bit banged 100 MBit/s Ethernet transmission on RPi Pico

The test for U.S. citizenship is about to get harder

The viral new "Definition of AGI" paper has fake citations which do not exist

Skillz: Anthropic‑Style Skills for Any MCP Client

Learn Go the Hard Way

TanStack DB: A reactive client store for building super fast apps

Flowistry: An IDE plugin for Rust that focuses on relevant code

Gnome Has a New Security Threat Scanner Powered by VirusTotal

Rare Earths Aren't Rare

How to Get to Mars

Marine colonel quits after 24 years citing concern for future of US under Trump

Google's AI Cracks a New Cancer Code

Winre Freezes After KB5066835

Browsing behavior exposes identities on the Web

SHOW HN: I made a site for 100% location independent jobs too

We are in the "gentleman scientist" era of AI research

Show HN: Silly Morse code chat app using WebSockets

The Best Way to Learn Might Be Starting at the End

The Future of Attention

AI-related data centres use vast amounts of water

A Random Walk in ℤ⁵

Show HN: Terminal Markdown notes using lockbook's CLI