42

Recently I learned about GraphQL which claims to be superior to RESTful. However, I started wondering why don't we simply put SQL statements into an HTTP GET request.

For example, in GraphQL I would write

{
  Movie(id: "cixos5gtq0ogi0126tvekxo27") {
    id
    title
    actors {
      name
    }
  }
}

Which isn't much simpler than its SQL counterpart

SELECT id, title FROM movies 
  WHERE id = cixos5gtq0ogi0126tvekxo27;

SELECT actors.name FROM actors, actors_movies 
  WHERE actors.id == movies.actor_id 
  AND movie.id == cixos5gtq0ogi0126tvekxo27;

Maybe we can URL-encode the query and send to the server

GET endpoint?q=SELECT%20id%2C%20title%20FROM%20movies%20WHERE%20id%20%3D%20cixos5gtq0ogi0126tvekxo27%3B%0ASELECT%20actors.name%20FROM%20actors%2C%20actors_movies%20WHERE%20actors.id%20%3D%3D%20movies.actor_id%20AND%20movie.id%20%3D%3D%20cixos5gtq0ogi0126tvekxo27%3B HTTP/1.1

Yes, the query URL can be too long, but you can put it into the body of a POST request if you don't care about REST compliance. (By the way, I think the HTTP RFC need be revised for REST to make sense: capping the length of query strings mixes implementation with specification at the very beginning)

Directly issuing SQL from the client also has the advantage of

  1. No server-side code/library is required to parse GraphQL, reducing development time.
  2. No server-side overhead is needed to parse GraphQL, reducing runtime.
  3. SQL statements are much more flexible than GraphQL because (in most cases) the latter will reduce to SQL anyway.
  4. Everyone knows SQL.

So, what the advantages GraphQL have over SQL?

25
  • 51
    Little Bobby Tables. Commented Apr 6, 2019 at 7:05
  • 5
    1. I can still DoS you with arbitrarily complicated SQL queries. 2. There's no chance a malicious actor would ever obtain a valid key... Commented Apr 6, 2019 at 7:16
  • 7
    @PhilipKendall You are right, but using GraphQL (or REST or whatever) doesn't solve these problems either, right?
    – nalzok
    Commented Apr 6, 2019 at 7:18
  • 12
    @nalzok: SQL is Turing-complete, which means it is impossible to validate statically. Commented Apr 6, 2019 at 7:45
  • 3
    This is very simple to understand why it's a terrible idea. Implement it yourself. At some point, you will realise that your are investing the time mostly in 1 thing: security. Not too later you will feel somewhat upset because you are implementing a caped TOAD. Then you will realise how hard is mapping rows all over the system and you will try to reinvent the ORM wheel on both sides: client and server. By the time you give up, your PM will ask you for report: how is the users' service going? Is it done?"...
    – Laiv
    Commented Apr 6, 2019 at 12:24

5 Answers 5

44

Basically, abstraction.

SQL requires your clients to know your exact database structure, which is not good. On top of that, analysing the SQL in order to perform special operations based on the value sent as the input is a really difficult thing to do. There are entire softwares which are pretty much responsible only for that. Do you know what those are? If you have guessed the databases, you are right.

Thanks to not exposing the SQL directly, you are not limiting the consumer of the API to the internal representation of your database. You easily expose only what you want to expose.

And since clients of the API depend only on the abstraction, you are free to have as many layers as possible between the API input and the actual database (security, caching, loading data from multiple databases on a single request,...).

For public services, exposing a database directly is pretty much never the right approach. If you however have a few internal systems, sure, your approach might make sense but even then it might just be easier to connect to application A's database directly from Application B by giving the database credentials to the Application B, rather than trying to come up with a custom HTTP interface for the database SQL language.


Why can't I just compare the URL (or SQL query) against keys in Redis before performing the actual query on the RDBMS?

Because it's not easy. Even if someone uses a very simple query, such as:

SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.name = 'hello
world' AND st.type = 'STANDARD'

how do you make sure the result is properly cached? This query includes newlines, but someone could just as well write the query in the following way:

SELECT st.id, jt.name FROM some_table st INNER JOIN join_table jt ON jt.some_table_id = st.id WHERE st.name = 'hello
world' AND st.type = 'STANDARD'

and it's still supposed to be cached in the same way as the one above. I have specifically included a where in which a string search contains a new line, so simply finding line endings and replacing them with a space is not going to work here, parsing the request correctly would be much more complicated.

And even if you do fix that, another query could switch the order of conditions and the query would look like this:

SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world'

and another request could contain a redundant WHERE argument, like this:

SELECT st.id, jt.name
FROM some_table st
INNER JOIN join_table jt ON jt.some_table_id = st.id
WHERE st.type = 'STANDARD' AND st.name = 'hello
world' AND st.stype = 'STANDARD'

All of those queries are still supposed to return the same result, should be cached in the same way. But handling all of the possible options is pretty much impossible. That's why you cannot simply compare the URL against keys in Redis.

8
  • This is a nice answer, but please see the update.
    – nalzok
    Commented Apr 7, 2019 at 4:16
  • 1
    how do you make sure the result is properly cached? parse it into an AST and hash that. Not that you would want to do that which the DBMS is probably already doing.
    – StingyJack
    Commented Dec 21, 2020 at 18:52
  • 2
    and GraphQL doesn't require you to know your GraphQL structure? Commented Dec 22, 2020 at 18:33
  • @user253751, what I meant by that sentence is the inversion: by exposing SQL directly, you're pretty much required to expose all your tables - which is detailed in the next paragraph. Having a control of your underlying layer without exposure is extremely important, because it makes refactoring much easier, since clients do not depend on the implementation (the DB structure), they rely on the abstraction you've provided. ACL is better implemented in the application layer, along with other mechanism which prevent unwanted access to the data.
    – Andy
    Commented Dec 22, 2020 at 19:08
  • @Andy views and permissions solve the basic table access and operation security problem. I think Ewan gives a better explanation, there are more precarious features like temporary tales, etc, which are hard to reason about the consequences of. Commented Apr 26, 2021 at 17:59
32

In theory there is no reason you can't expose an SQL interface like this.

In practice SQL is far too powerful to be effectively limited to the security scope you want to expose.

Even if you allow only read access, a bad query can still hog resources.

Other languages such as graphQL are designed to be exposed. They are merely giving users a filter option on what they could already see.

The benefit of using these languages is that they have gone through all the things you would want to stop users doing in SQL and taken them off the table.

10
  • 13
    Thanks for the answer, but could you explain how GraphQL solve the problem of resource draining? A rogue GraphQL query can still say “tell me everything about each movie and their actors“, resulting in a huge graph, and exhausting my DBMS and network.
    – nalzok
    Commented Apr 7, 2019 at 3:14
  • 8
    the problem is not so much restricting access to tables, or deleting, but the shear complexity of SQL. will you allow temp table creation? what about executing CLI? loops? transactions? sub selects? cursors? how will you distinguish when the use of these things is acceptable and when its 'bad'
    – Ewan
    Commented Apr 7, 2019 at 11:38
  • 3
    @Ewan, if there was a serious attempt to restrict the power of SQL, wouldn't that be done by implementing a parser for SQL and attaching additional security controls, rather than implementing a(nother) special query language whose only selling point is that it is less powerful?
    – Steve
    Commented Dec 22, 2020 at 8:00
  • 1
    tricky if you have single commands that do both allowed and non-allowed operations. It's easier to create a new language from the ground up
    – Ewan
    Commented Dec 22, 2020 at 8:38
  • 1
    I should note, that some APIs do in fact implement a limited SQL subset for use over APIs. I don't necessarily trust them, but I must assume that whoever is implementing them must know what they are doing. Big companies and all that.
    – Shayne
    Commented May 27, 2022 at 3:01
10

Because almost nobody understand that SQL != RDBMS != Relational model.

SQL is just a query language. It does not even need a relational database system. It is not fully relational either.

It also neither demand that is implemented any more fancy that simple CRUD, and not even need to implement CRUD. It happily is used as just read-only query data back pls.

Is just an (happy) accident that SQL + RDBMS is such a powerful combo that both become conflated. Also, that SQL is standardized across so many RDBMS that some of the capabilities of them surface, meaning that it can look more complex that something anemic like GraphQL or Rest can even dream.

Also, that is extended to be crud + schema manipulation + scripting + anything anybody wants or needs.

SQL is THAT FLEXIBLE.

So, is totally valid to put SQL on top of HTTP and that not means that MUST be directly feed to the database. Can be, totally, alike GraphQL and have a in-memory resolver that do whatever it need to be transformed, later, to the "real" SQL of the RDBMS or ANYTHING ELSE, because is part of the relational model foundational designs to be a abstraction over storage and query.

That is why, you use SQL (tables) on top of Trees (B-Tree) or arrays (Columnar) or Graphs, or files, or whatever on the sun.

1
  • this guy gets it. yours is the best answer here. i'm surprised how stuck everyone is in their thinking
    – symbiont
    Commented Mar 13 at 21:54
6

As others have mentioned, exposing SQL directly in the api is a very bad option. GraphQL, despite it's name, is not an abstraction for SQL, but for any data store or even other services.

If you are looking for an abstraction that is closer to SQL, you might want to have a look at odata (if you happen to work in .NET backends, though maybe other implementations exist).

1
  • This is another cult-following answer. There is no reason why SQL databases can't have imported tables and extensions.
    – Asclepius
    Commented Aug 7, 2022 at 21:53
0

if you want expose SQL like GraphQL, you will could need something like GraphQL, because you will need hide the important information and select what you want to show in the API, this for security.

GraphQl and SQL are different things, SQL is the language to query DataBase and GraphQL is only to manage the data from API,in API you will need make yours schemas to show and querys to manage it, etc.

in any API you will need make those things for simply security, but if you want something that is free data access maybe it would work, you know so many alternatives in the software world

3
  • 1
    Nonsense, because you can hide using SQL database views (and access restrictions) just the same.
    – Asclepius
    Commented Aug 7, 2022 at 21:55
  • @Asclepius you're right, that it's an error for my young version hahaha, views can work with the acesss of the data, thanks for show it
    – Black Hole
    Commented Aug 8, 2022 at 21:12
  • @Asclepius anyways that wasn't the point, was about the interface that GraphQL bring to you, this make you able to make querys from frontend and give to you very flexibility in an API. and with SQL you gonna need make the especific controllers and querys, and of course you can do the same (with UrlQueryParams) but graphQL does very well in this case.
    – Black Hole
    Commented Aug 8, 2022 at 21:16

Not the answer you're looking for? Browse other questions tagged or ask your own question.