How to randomly select rows in SQL?

Question

I am using MSSQL Server 2005. In my db, I have a table "customerNames" which has two columns "Id" and "Name" and approx. 1,000 results.

I am creating a functionality where I have to pick 5 customers randomly every time. Can anyone tell me how to create a query which will get random 5 rows (Id, and Name) every time when query is executed?

Random is not a common requirement for a Database, I was surprised to find a link for some SQL — Paxic, Commented Feb 24, 2009 at 6:20
Depends on how much randomness you want. See: msdn.microsoft.com/en-us/library/aa175776(SQL.80).aspx for comparison of NEW_ID versus RAND() — Shannon Severance, Commented Jul 30, 2009 at 23:36

Katherine Mejia-Guerra · Accepted Answer · 2017-01-13 21:19:04Z

832

SELECT TOP 5 Id, Name FROM customerNames
ORDER BY NEWID()

That said, everybody seems to come to this page for the more general answer to your question:

Selecting a random row in SQL

Select a random row with MySQL:

SELECT column FROM table
ORDER BY RAND()
LIMIT 1

Select a random row with PostgreSQL:

SELECT column FROM table
ORDER BY RANDOM()
LIMIT 1

Select a random row with Microsoft SQL Server:

SELECT TOP 1 column FROM table
ORDER BY NEWID()

Select a random row with IBM DB2

SELECT column, RAND() as IDX 
FROM table 
ORDER BY IDX FETCH FIRST 1 ROWS ONLY

Select a random record with Oracle:

SELECT column FROM
( SELECT column FROM table
ORDER BY dbms_random.value )
WHERE rownum = 1

Select a random row with sqlite:

SELECT column FROM table 
ORDER BY RANDOM() LIMIT 1

edited Jan 13, 2017 at 21:19

Katherine Mejia-Guerra

1181 silver badge6 bronze badges

answered Jul 30, 2009 at 23:28

Curtis Tasker

11.4k2 gold badges24 silver badges23 bronze badges

34

Does this become very expensive on large tables, where each row gets a random number, and then a large unindexed random number set is sorted?
– Andrey
Commented Apr 19, 2014 at 16:04
1

This is perhaps obvious to most people, but it wasn't obvious to me... the following query will not get a new random value for each row: update tbl_vouchers set tbl_UsersID = (select top(1) id from tbl_Users order by NEWID()) - edit: I can't get formatting to work in comments :(
– Mir
Commented Dec 10, 2015 at 18:35
Why does this fail on Google Cloud SQL? We only get partially random results. Nearly 80% of the time we get the same row back.
– Praxiteles
Commented Nov 25, 2016 at 22:38
8

Warning: For big databases this method will have a bad performance. Can you imagine the time it will take to generate a random value for each row if the database have a million of entry? You can have more information about and a better alternativ here.
– Francis Ngueukam
Commented Dec 15, 2016 at 9:33
Thanks for the solution. Just wanted to know if we can assign some kind of a declare variable to the value after the 'limit' keyword. I am trying to find solutions in bigquery but haven't had much luck yet.
– Ajay Kumar
Commented Jan 20, 2020 at 9:52

| Show 1 more comment

Cody Caughlan · Accepted Answer · 2009-02-24 06:21:37Z

39

SELECT TOP 5 Id, Name FROM customerNames ORDER BY NEWID()

answered Feb 24, 2009 at 6:21

Cody Caughlan

32.7k5 gold badges65 silver badges68 bronze badges

Add a comment |

Barry Brown · Accepted Answer · 2009-02-24 06:45:56Z

16

In case someone wants a PostgreSQL solution:

select id, name
from customer
order by random()
limit 5;

answered Feb 24, 2009 at 6:45

Barry Brown

20.5k15 gold badges70 silver badges106 bronze badges

Add a comment |

TylerH · Accepted Answer · 2020-05-04 20:16:16Z

13

I have found this to work best for big data.

SELECT TOP 1 Column_Name FROM dbo.Table TABLESAMPLE(1 PERCENT);

TABLESAMPLE(n ROWS) or TABLESAMPLE(n PERCENT) is random but need to add the TOP n to get the correct sample size.

Using NEWID() is very slow on large tables.

edited May 4, 2020 at 20:16

TylerH

21.1k72 gold badges78 silver badges105 bronze badges

answered Aug 15, 2013 at 23:08

Billy

3613 silver badges5 bronze badges

Works well, thought I'd post a link to MS documentation on tablesample clause for people interested in what it does: learn.microsoft.com/en-us/azure/databricks/sql/language-manual/…
– Luke Alderton
Commented Jul 9 at 8:34

Add a comment |

user60456user60456 · Accepted Answer · 2009-02-24 06:21:20Z

11

Maybe this site will be of assistance.

For those who don't want to click through:

SELECT TOP 1 column FROM table
ORDER BY NEWID()

answered Feb 24, 2009 at 6:21

user60456

4

should have at least replaced 1 with 5 :)
– roman m
Commented Feb 24, 2009 at 6:40

Add a comment |

JohnC · Accepted Answer · 2012-09-23 12:17:51Z

8

There is a nice Microsoft SQL Server 2005 specific solution here. Deals with the problem where you are working with a large result set (not the question I know).

Selecting Rows Randomly from a Large Table http://msdn.microsoft.com/en-us/library/cc441928.aspx

answered Sep 23, 2012 at 12:17

JohnC

3,0871 gold badge25 silver badges33 bronze badges

Add a comment |

Protiguous · Accepted Answer · 2019-12-27 00:36:01Z

This is an old question, but attempting to apply a new field (either NEWID() or ORDER BY rand()) to a table with a large number of rows would be prohibitively expensive. If you have incremental, unique IDs (and do not have any holes) it will be more efficient to calculate the X # of IDs to be selected instead of applying a GUID or similar to every single row and then taking the top X # of.

DECLARE @minValue int;
DECLARE @maxValue int;
SELECT @minValue = min(id), @maxValue = max(id) from [TABLE];

DECLARE @randomId1 int, @randomId2 int, @randomId3 int, @randomId4 int, @randomId5 int
SET @randomId1 = ((@maxValue + 1) - @minValue) * Rand() + @minValue
SET @randomId2 = ((@maxValue + 1) - @minValue) * Rand() + @minValue
SET @randomId3 = ((@maxValue + 1) - @minValue) * Rand() + @minValue
SET @randomId4 = ((@maxValue + 1) - @minValue) * Rand() + @minValue
SET @randomId5 = ((@maxValue + 1) - @minValue) * Rand() + @minValue

--select @maxValue as MaxValue, @minValue as MinValue
--  , @randomId1 as SelectedId1
--  , @randomId2 as SelectedId2
--  , @randomId3 as SelectedId3
--  , @randomId4 as SelectedId4
--  , @randomId5 as SelectedId5

select * from [TABLE] el
where el.id in (@randomId1, @randomId2, @randomId3, @randomId4, @randomId5)

If you wanted to select many more rows I would look into populating a #tempTable with an ID and a bunch of rand() values then using each rand() value to scale to the min-max values. That way you do not have to define all of the @randomId1...n parameters. I've included an example below using a CTE to populate the initial table.

DECLARE @NumItems int = 100;

DECLARE @minValue int;
DECLARE @maxValue int;
SELECT @minValue = min(id), @maxValue = max(id) from [TABLE];
DECLARE @range int = @maxValue+1 - @minValue;

with cte (n) as (
   select 1 union all
   select n+1 from cte
   where n < @NumItems
)
select cast( @range * rand(cast(newid() as varbinary(100))) + @minValue as int) tp
into #Nt
from cte;

select * from #Nt ntt
inner join [TABLE] i on i.id = ntt.tp;

drop table #Nt;

@Protiguous, the edit you proposed broke the random selection. Using min() and max() applied to the dbo.Tally64k table would not allow the user to select a row with a pk id > 65556. — RIanGillis, Commented Sep 23, 2019 at 13:41
The table name change was simply an artifact from testing. The actual table name doesn't matter, as long as the correct table is used. min() and max() can both be queried in one query rather than two, which is what I was trying to show. — Protiguous, Commented Jan 14, 2020 at 21:48
@Protiguous Ah, I see that now, I was confused because you used the 0-65k when doing the min-max but not later. After your most recent edit I actually wanted to ask you about the performance implications of the changes you made, as performance tuning is one of my interests and seemingly meaningless decisions like which side of the equals sign you place something can actually have a significant impact --- Would the same thing apply to the 5 SET @randomId## calls? Or is that different because it is not SELECTing FROM an actual table? — RIanGillis, Commented Jan 15, 2020 at 4:49
I'm not sure I understand your question. Are you asking why there are 5 SET instead of just 1 SELECT @id1=rand(), @id2=rand().. ? It's because multiple calls to a rand() in 1 statement will produce the same result, hence the separated SET. (rand() on SQL Server is a deterministic function, I believe.) I would guess that 1 select vs 5 set is in the nanosecond range performance-wise. — Protiguous, Commented Mar 6, 2020 at 23:43

Tohid · Accepted Answer · 2018-03-03 22:14:37Z

6

If you have a table with millions of rows and care about the performance, this could be a better answer:

SELECT * FROM Table1
WHERE (ABS(CAST(
  (BINARY_CHECKSUM
  (keycol1, NEWID())) as int))
  % 100) < 10

https://msdn.microsoft.com/en-us/library/cc441928.aspx

answered Mar 3, 2018 at 22:14

Tohid

6,4918 gold badges54 silver badges81 bronze badges

Note that this will select approximately 10% of the rows in the table. If you need to select an exact number of rows, or at least N rows, this approach won't work.
– LarsH
Commented Jun 3, 2019 at 14:49

Add a comment |

Pang · Accepted Answer · 2020-03-25 23:59:55Z

6

SELECT * FROM TABLENAME ORDER BY random() LIMIT 5;

edited Mar 25, 2020 at 23:59

Pang

9,932146 gold badges85 silver badges124 bronze badges

answered Feb 25, 2015 at 7:49

Narendra

96715 silver badges29 bronze badges

Add a comment |

Vlad Mihalcea · Accepted Answer · 2021-01-20 19:56:40Z

In order to shuffle the SQL result set, you need to use a database-specific function call.

Note that sorting a large result set using a RANDOM function might turn out to be very slow, so make sure you do that on small result sets.

If you have to shuffle a large result set and limit it afterward, then it's better to use something like the Oracle SAMPLE(N) or the TABLESAMPLE in SQL Server or PostgreSQL instead of a random function in the ORDER BY clause.

So, assuming we have the following database table:

And the following rows in the song table:

| id | artist                          | title                              |
|----|---------------------------------|------------------------------------|
| 1  | Miyagi & Эндшпиль ft. Рем Дигга | I Got Love                         |
| 2  | HAIM                            | Don't Save Me (Cyril Hahn Remix)   |
| 3  | 2Pac ft. DMX                    | Rise Of A Champion (GalilHD Remix) |
| 4  | Ed Sheeran & Passenger          | No Diggity (Kygo Remix)            |
| 5  | JP Cooper ft. Mali-Koa          | All This Love                      |

Oracle

On Oracle, you need to use the DBMS_RANDOM.VALUE function, as illustrated by the following example:

SELECT
    artist||' - '||title AS song
FROM song
ORDER BY DBMS_RANDOM.VALUE

When running the aforementioned SQL query on Oracle, we are going to get the following result set:

| song                                              |
|---------------------------------------------------|
| JP Cooper ft. Mali-Koa - All This Love            |
| 2Pac ft. DMX - Rise Of A Champion (GalilHD Remix) |
| HAIM - Don't Save Me (Cyril Hahn Remix)           |
| Ed Sheeran & Passenger - No Diggity (Kygo Remix)  |
| Miyagi & Эндшпиль ft. Рем Дигга - I Got Love      |

Notice that the songs are being listed in random order, thanks to the DBMS_RANDOM.VALUE function call used by the ORDER BY clause.

SQL Server

On SQL Server, you need to use the NEWID function, as illustrated by the following example:

SELECT
    CONCAT(CONCAT(artist, ' - '), title) AS song
FROM song
ORDER BY NEWID()

When running the aforementioned SQL query on SQL Server, we are going to get the following result set:

| song                                              |
|---------------------------------------------------|
| Miyagi & Эндшпиль ft. Рем Дигга - I Got Love      |
| JP Cooper ft. Mali-Koa - All This Love            |
| HAIM - Don't Save Me (Cyril Hahn Remix)           |
| Ed Sheeran & Passenger - No Diggity (Kygo Remix)  |
| 2Pac ft. DMX - Rise Of A Champion (GalilHD Remix) |

Notice that the songs are being listed in random order, thanks to the NEWID function call used by the ORDER BY clause.

PostgreSQL

On PostgreSQL, you need to use the random function, as illustrated by the following example:

SELECT
    artist||' - '||title AS song
FROM song
ORDER BY random()

When running the aforementioned SQL query on PostgreSQL, we are going to get the following result set:

| song                                              |
|---------------------------------------------------|
| 2Pac ft. DMX - Rise Of A Champion (GalilHD Remix) |
| JP Cooper ft. Mali-Koa - All This Love            |
| Ed Sheeran & Passenger - No Diggity (Kygo Remix)  |
| HAIM - Don't Save Me (Cyril Hahn Remix)           |
| Miyagi & Эндшпиль ft. Рем Дигга - I Got Love      |

Notice that the songs are being listed in random order, thanks to the random function call used by the ORDER BY clause.

MySQL

On MySQL, you need to use the RAND function, as illustrated by the following example:

SELECT
  CONCAT(CONCAT(artist, ' - '), title) AS song
FROM song
ORDER BY RAND()

When running the aforementioned SQL query on MySQL, we are going to get the following result set:

| song                                              |
|---------------------------------------------------|
| HAIM - Don't Save Me (Cyril Hahn Remix)           |
| Ed Sheeran & Passenger - No Diggity (Kygo Remix)  |
| Miyagi & Эндшпиль ft. Рем Дигга - I Got Love      |
| 2Pac ft. DMX - Rise Of A Champion (GalilHD Remix) |
| JP Cooper ft. Mali-Koa - All This Love            |

Notice that the songs are being listed in random order, thanks to the RAND function call used by the ORDER BY clause.

Palash Mondal · Accepted Answer · 2020-02-12 07:37:16Z

1

If you are using large table and want to access of 10 percent of the data then run this following command: SELECT TOP 10 PERCENT * FROM Table1 ORDER BY NEWID();

answered Feb 12, 2020 at 7:37

Palash Mondal

5264 silver badges10 bronze badges

Add a comment |

John · Accepted Answer · 2021-03-01 06:50:25Z

0

If you use Yandex Database then you should use

select column from table order by random (TableRow()) limit 1;

answered Mar 1, 2021 at 6:50

John

4617 silver badges22 bronze badges

Add a comment |

Muzib · Accepted Answer · 2022-06-06 16:20:52Z

0

If you don't want to use NEWID() and the primary key column is int, then you can just select a random primary key like this:

with a as 
(
select count(id) as row_count
from mytable
)

select *
from mytable , a
where id = round(rand() * row_count, 0)

answered Jun 6, 2022 at 16:20

Muzib

2,5723 gold badges23 silver badges33 bronze badges

Add a comment |

Yura · Accepted Answer · 2023-07-20 14:55:58Z

0

If you need just to shuffle sequential values then you don't need always to use random (as it's non sql standard), you can try to use some tricks, like using reverse(PK)

SELECT PK FROM products ORDER BY REVERSE(concat('', PK))

So let's say we have a values: 123 124 125 223 224 225 323 324 325

then after reverse we will see them in the following order:

answered Jul 20, 2023 at 14:55

Yura

1,8031 gold badge21 silver badges19 bronze badges

Add a comment |

Collectives™ on Stack Overflow

How to randomly select rows in SQL?

14 Answers 14

Selecting a random row in SQL

Select a random row with MySQL:

Select a random row with PostgreSQL:

Select a random row with Microsoft SQL Server:

Select a random row with IBM DB2

Select a random record with Oracle:

Select a random row with sqlite:

Oracle

SQL Server

PostgreSQL

MySQL

Not the answer you're looking for? Browse other questions tagged
sql
database
random
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

14 Answers 14

Select a random row with MySQL:

Select a random row with PostgreSQL:

Select a random row with Microsoft SQL Server:

Select a random row with IBM DB2

Select a random record with Oracle:

Select a random row with sqlite:

Oracle

SQL Server

PostgreSQL

MySQL

Not the answer you're looking for? Browse other questions tagged sqldatabaserandom or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
sql
database
random
or ask your own question.