Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Intro to MongoDB

Intro to MongoDB

Introduction to MongoDB

3827e51a64ab50c6277478ed075fc52d?s=128

20spokes

May 17, 2012
Tweet

Transcript

  1. MongoDB 101 Ryan Fischer @ryanfischer20 Thursday, May 17, 12

  2. What you will hear What is NoSQL Available NoSQL Databases

    Intro to MongoDB Thursday, May 17, 12
  3. What is NoSQL Thursday, May 17, 12

  4. Fancy Answer NoSQL is a class of database management system

    identified by its non-adherence to the widely-use relational database management system Thursday, May 17, 12
  5. My Answer It’s not SQL Thursday, May 17, 12

  6. Does not use SQL as its query language May not

    give full ACID guarantees Distributed architecture Typically optimized for reading and writing operations Thursday, May 17, 12
  7. Advantages Thursday, May 17, 12

  8. Traditional Scaling Bigger is better! (or so they thought) Increase

    the size and power of the server Thursday, May 17, 12
  9. Scaling with NoSql Scale horizontally! Distribute across multiple servers More

    economical using lower-cost servers Thursday, May 17, 12
  10. Goodbye Schemas Flexible data models Easy to add/change data structures

    Thursday, May 17, 12
  11. Disadvantages Thursday, May 17, 12

  12. Goodbye Schemas Flexible data structures Application dependent on integrity Thursday,

    May 17, 12
  13. NoSQl is Still Young Does not reduce administration (at least

    not yet) Lack of expertise Lack of projects expanding on NoSQL Thursday, May 17, 12
  14. NoSQL Databases Cassandra MongoDB CouchDB Redis Riak Neo4j Thursday, May

    17, 12
  15. How is it used? Thursday, May 17, 12

  16. NoSQL out in the Wild Analytics - takes advantage of

    read/write optimizations Logging Large Scale Projects Thursday, May 17, 12
  17. MongoDB Thursday, May 17, 12

  18. What is MongoDB Document Oriented Storage Replication & Auto-Sharding Document-based

    queries similar to SQL Atomic Updates Map/Reduce Thursday, May 17, 12
  19. Document Oriented No schemas!! No joins for high performance and

    scalability embed documents JSON-Style storage Thursday, May 17, 12
  20. High Performance Stores a lot of data in memory Embedding

    documents increase read and writes Allows indexing Thursday, May 17, 12
  21. Availability and Scalability Replicated servers with automatic master failover Auto-sharing

    across servers Consistent reads distributed over replicated servers Thursday, May 17, 12
  22. Atomic Modifers In place updating documents Does not replace entire

    document Ideally suited for write heavy applications Thursday, May 17, 12
  23. Storing data Data is grouped by collections Collection contains documents

    of key-value pairs Values can be rich including arrays and documents Stored as BSON - Binary Serialized Document Notation Thursday, May 17, 12
  24. Querying Javascript console allows for functions Returns a cursor -

    lazy load of results Queries expressed as JSON Documents auto-assigned ObjectId Thursday, May 17, 12
  25. Examples Interactive time! Goto to https://gist.github.com/2719591 for examples Thursday, May

    17, 12
  26. Embed vs Referenced Relationships for models Object Models - Think

    differently When in doubt store in different collection Thursday, May 17, 12
  27. Geospatial Queries Thursday, May 17, 12

  28. SQL SELECT * FROM Places WHERE acos(sin(1.3963) * sin(Lat) +

    cos(1.3963) * cos(Lat) * cos(Lon - (-0.6981))) * 6371 <= 1000; Thursday, May 17, 12
  29. Exact Queries Search by closest points Ways to search Thursday,

    May 17, 12
  30. Query within a rectangle Circle with a center point and

    radius Search within a polygon ( >= 1.9 ) Bound Queries Thursday, May 17, 12
  31. Thursday, May 17, 12

  32. Thursday, May 17, 12

  33. Spherical Model Use decimal degrees - 42.53 Use [longitude, latitude]

    as ordering Use radians for distance $nearSphere and $centerSphere Thursday, May 17, 12
  34. GridFS Store large files in MongoDB Stores it in chunks

    Thursday, May 17, 12
  35. What is Sharding Allows MongoDB to scale horizontally Evenly distributes

    chunks of data Performed per collection Thursday, May 17, 12
  36. Thursday, May 17, 12

  37. Disadvantages No inherit transaction support Scaling sometimes isn’t simple Multiple

    servers recommended Object modeling can be complex Thursday, May 17, 12
  38. Advantages Active community including 10Gen Driver support for most languages

    Many new features to come Thursday, May 17, 12
  39. The big data loss debate Internet flame war history Mongo

    performs one write at a time - global lock Stored in memory Replication - fail over Thursday, May 17, 12
  40. The End Follow me - @ryanfischer20 Thursday, May 17, 12

  41. Thursday, May 17, 12