Can JOINs cause duplicate rows in results? How do you eliminate them?

How JOINs Can Produce Duplicate Rows and How to Remove Them

Yes, JOINs can cause duplicate rows when the relationship between tables is not strictly one-to-one. Any one-to-many or many-to-many relationship will multiply rows, which may appear as duplicates in the result set—especially when joining tables like customers → orders → order_items.

Why JOINs Cause Duplicates

One-to-many relationships expand rows (e.g., one order with multiple payments).
Duplicate matching values in the joined table multiply rows.
Lack of proper join conditions results in unintended row combinations.
Joining denormalized tables may bring repeated values that look like duplicates.

Example: JOIN that Produces Duplicates

If a customer has 3 orders, they appear 3 times. These aren't true "duplicates"—they're correct representations of a one-to-many relationship. But sometimes duplicates are accidental and need removal.

How to Eliminate Duplicate Rows

Use DISTINCT to return only unique combinations of selected columns.
Use GROUP BY to collapse rows (only when grouping makes sense).
Refine JOIN conditions to avoid unintended matches.
Normalize data to prevent duplicate stored values.
Use EXISTS instead of JOIN when only existence needs to be checked.

Using DISTINCT to Remove Duplicates

Using EXISTS to Avoid Join Multiplication

EXISTS is often faster and avoids row-multiplying JOINs when you're only checking whether related data exists.

Best Practices

Understand the relationship (one-to-one, one-to-many, many-to-many).
Use DISTINCT only when logically correct—it hides problems instead of fixing them.
Use GROUP BY when aggregating data, not just to remove duplicates.
Avoid unnecessary JOINs when simpler EXISTS queries suffice.

Question Loading...

Functions and Operators

Keys

Joins

Triggers