NoSQL - what's that

1. NoSQL – What’s that?SergejusBarinovas | Microsoft MVP@sergejusb, sergejus.blogas.lt

2. NoSQL

3. WHY?

4. Limited SQL scalabilityHorizontal partitioning (sharding)Vertical partitioningNoSQL – Why?

5. Limited SQL availabilityMaster / slave configurationNoSQL – Why?

6. SQL limitations for storing huge amount of dataKey / value / type columnsNoSQL – Why?

7. Limited SQL speed of read/write operationsMultiple read replicasNoSQL – Why?

8. 2009, Eric Evans

9. NoSQL – open source distributed databases, not relational SQL databases

10. NoSQL – not only SQL

11. NoSQL->Big DataNoSQL History

12. The ability to horizontally scale simple-operation throughput over many serversNoSQL Characteristics (scalability)

13. A “weaker” concurrency model than the ACID transactions in most SQL systemsNoSQL Characteristics (BASE)

14. Efficient use of distributed indexes and RAM for data storageNoSQL Characteristics (distributed)

15. The ability to dynamically define new attributes or data schemaNoSQL Characteristics (schema-less)

16. Atomicity – all or nothing

17. Consistency – state integrity

18. Isolation – no reads of uncommitted data

19. Durability – recover committed transACID (transactions)

20. 2000, Eric BrewerIt is impossible for a distributed computer system to simultaneously provide all three of the following guarantees:Consistency

21. Availability

22. Partition toleranceCAP Theorem

23. Basically – partial system failures are OKAvailable

24. Soft state – inconsistency is OK

25. Eventual consistency – stale data is OK BASE (eventual consistency)

27. NoSQL Databases

28. Key / value store

29. Document database

30. Graph database

31. Columnar databaseNoSQL Categories

32. <key, value> or Tuple<key, v1,. ., vn>

33. Simple operationsGetPutDeleteKey / value storeKeyValueByte[]Byte[]

34. Key / value storeKeyValue“current_date”2011.01.16“sergejusb”Binary Object“sergejusb”JSON Object

35. Dynamo*

36. Membase

37. Voldermort

38. Redis

39. Azure Table Storage

40. RiakKey / value store

41. Name: DynamoCreated: 2007, Amazon (proprietary)Implementation: ?Distributed: YesReplication: Multiple ServersCAP: APAPI: ?Key / value store

42. Name: MembaseCreated: 2010, sponsored by ZingaImplementation: C / C++ / ErlangDistributed: YesReplication: Multiple ServersCAP: CPAPI: Memcached API, JSONKey / value store

43. Name: VoldemortCreated: 2008, LinkedInImplementation: JavaDistributed: YesReplication: Multiple ServersCAP: APAPI: JavaKey / value store

44. Name: RedisCreated: 2009, sponsored by VMWareImplementation: CDistributed: NoReplication: Master / SlaveCAP: CPAPI: Various LanguagesKey / value store

45. Name: Azure Table StorageCreated: 2008, MicrosoftImplementation: ?Distributed: YesReplication: Multiple Servers (DFS)CAP: CPAPI: .NET API, JSONKey / value store

46. Name: RiakCreated: 2008, Basho (from Akamai)Implementation: ErlangDistributed: YesReplication: Multiple ServersCAP: APAPI: JSONKey / value store

47. Document == complex objectXMLYAMLJSON / BSONSupport for secondary indexes

48. Schema can be defined at runtime

49. Optional support for simple querying using Map / ReduceDocument database

50. MongoDB

51. CouchDB

52. RavenDBDocument database

53. Name: MongoDBCreated: 2008, 10genImplementation: C++Distributed: Yes via ShardsReplication: Master / SlaveCAP: CPAPI: BSONDocument database

54. Name: CouchDBCreated: 2005Implementation: ErlangDistributed: Sort ofReplication: Master / MasterCAP: APAPI: JSONDocument database

55. Name: RavenDBCreated: 2010, AyendeRahienImplementation: C#Distributed: Yes via ShardsReplication: Master / MasterCAP: APAPI: .NET API, JSONDocument database

56. Graph == network

57. Basic constructsNodeEdgePropertiesGraph databasesergejus.blogas.ltreadsauthorsknowssergejustdagysknows

58. FlockDB

59. Neo4JGraph database

60. Name: FlockDBCreated: 2010, TwitterImplementation: ScalaDistributed: YesReplication: Multiple ServersCAP: APAPI: Thrift, RubyGraph database

61. Name: Neo4JCreated: 2003, NeoTechnologiesImplementation: JavaDistributed: NoReplication: Master / SlaveCAP: CPAPI: JSON, Various LanguagesGraph database

62. For HUGE amount of data

63. Columns are added at a runtime

64. Great scalability Horizontal VerticalColumnar database

65. Unusual data modelKey Space == DatabaseColumn Family == TableColumns and Super ColumnsSuper Column == array of ColumnsColumn == Tuple<Key, Value, Timestamp, TTL>Columnar database

66. Columnar databaseSimple ColumnColumnar databaseSuper ColumnBigTable*

67. Cassandra

68. HBase

69. HypertableColumnar database

70. Name: BigTableCreated: 2006, GoogleImplementation: C++Distributed: YesReplication: Multiple Servers (GFS)CAP: CPAPI: C++Columnar database

71. Name: CassandraCreated: 2008, FacebookImplementation: JavaDistributed: YesReplication: Multiple ServersCAP: APAPI: Thrift, AvroColumnar database

72. Name: HBaseCreated: 2007, PowersetImplementation: JavaDistributed: YesReplication: Multiple Servers (HDFS)CAP: CPAPI: Thrift, Java, JSONColumnar database

73. Name: HypertableCreated: 2007, ZventsImplementation: CDistributed: YesReplication: Multiple ServersCAP: CPAPI: ThriftColumnar database

74. ORDER BY ?“Natural Key Order”NoSQL Limitations

75. GROUP BY ?Map / ReduceNoSQL Limitations

76. JOIN ?Multiple Map / ReduceNoSQL Limitations

77. SELECT * ?Multi-Machine Map / ReduceNoSQL Limitations

78. Maturity

79. Tooling

80. SpecificityNoSQL Limitations

81. Choose the right tool for the task

82. You can use BOTHSQL vs. NoSQL

83. Q & A

NoSQL - what's that

Related slideshows

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

Recommended for you

More Related Content

What's hot

What's hot (20)

Similar to NoSQL - what's that

Similar to NoSQL - what's that (20)

More from Sergejus Barinovas

More from Sergejus Barinovas (15)

Recently uploaded

Recently uploaded (20)

NoSQL - what's that

Editor's Notes