Slide 1

Slide 1 text

Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine Read the Blog Post | Vaibhaw Pandey

Slide 2

Slide 2 text

v Redis Redis is a popular and very fast in-memory database structure store primarily used as a cache or a message broker. Being in-memory, it’s the data store of choice when response times trump everything else. MongoDB MongoDB is an on-disk document store that provides a JSON interface to data and has a very rich query language. Known for its speed, efficiency, and scalability, it’s currently the most popular NoSQL database used today. However, being an on-disk database, it can’t compare favorably to an in-memory database like Redis in terms of absolute performance. But, with the availability of the in-memory storage engines for MongoDB, a more direct comparison becomes feasible. Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 3

Slide 3 text

Percona Memory Engine for MongoDB Starting in version 3.0, MongoDB provides an API to plug in the storage engine of your choice. A storage engine, from the MongoDB context, is the component of the database that’s responsible for managing how the data is stored, both in-memory and on-disk. MongoDB supports an in-memory storage engine, however, it’s limited to their Enterprise product. In 2016, Percona released an open source in-memory engine for the MongoDB Community Edition called the Percona Memory Engine for MongoDB. Like MonogDB’s in-memory engine, it’s also a variation of the WiredTiger storage engine, but with no persistence to disk. With an in-memory MongoDB storage engine in place, we have a level playing field between Redis and MongoDB. So, why do we need to compare the two? Let’s look at the advantages of each of them as a caching solution. Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 4

Slide 4 text

Let’s take a look at Redis Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 5

Slide 5 text

Redis Advantages A well-known caching solution that excels at it. Redis isn’t a plain cache solution – it has an advanced data structures that provide many powerful ways to save and query data that can’t be achieved with a vanilla key-value cache. Redis is fairly simple to setup, use and learn. Redis provides persistence that you can opt to set up, so cache warming in the event of a crash is hassle free. Redis Disadvantages It doesn’t have inbuilt encryption on the wire. No role-based account control (RBAC). There isn’t a seamless, mature clustering solution. Can be a pain to deploy in large-scale cloud deployments. ✔ ✔ ✔ ✔ ✔ ✔ ✔ ✔ Red a C c Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 6

Slide 6 text

Let’s take a look at MongoDB Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 7

Slide 7 text

MongoDB Advantages Traditional database with advanced data manipulation features (think aggregations and map-reduce) and a rich query language. SSL, RBAC, and scale-out built in. Operational and development costs drop for current MongoDB users as you only have one database to learn and manage. With an in-memory engine, it offers no persistence until it’s deployed as a replica set with persistence configured on the read replica(s). ✔ ✔ ✔ ✔ Mon a C c Look at this post from Peter Zaitsev on where the MongoDB in-memory engine might be a good fit. Read the Redis vs. MongoDB Blog Database-as-a-Service | MongoDB Disadvantages

Slide 8

Slide 8 text

Redis vs. In-Memory MongoDB Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 9

Slide 9 text

Performance: Redis vs. MongoDB Redis performs considerably better for reads for all sorts of workloads, and for writes as the workloads increase. Even though MongoDB utilizes all the cores of the system, it gets CPU bound comparatively early. While it still had compute available, it was better at writes than Redis. Both databases are eventually compute-bound. Even though Redis is single-threaded, it (mostly) gets more done with running on one core than MongoDB does while saturating all the cores. Redis, for non-trivial data sets, uses a lot more RAM compared to MongoDB to store the same amount of data. ✔ ✔ ✔ ✔ Now, le ’s o t e c gu on r u s Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 10

Slide 10 text

Configuration: Redis vs. MongoDB Database instance type: AWS EC2 c4.xlarge featuring 4 cores, 7.5 GB memory, and enhanced networking to ensure we don’t have any network bottlenecks. Client Machine: AWS EC2 c4.xlarge in the same virtual private cloud (VPC) as the database servers. Redis: Version 3.2.8 with AOF and RDB turned off. Standalone. MongoDB: Percona Memory Engine based on MongoDB version 3.2.12. Standalone. ✔ ✔ ✔ ✔ We used YCSB to measure the performance. Learn more about YCSB in our How to Benchmark MongoDB with YCSB post to see how we compare and benchmark database performance with various cloud providers and configurations.. We assume a basic understanding of YCSB workloads and features in the test rig description. Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 11

Slide 11 text

Configuration: Redis vs. MongoDB (2) Network Throughput: Measured via iperf as recommended by AWS: Workload Details 1. Insert Workload: 100% Writes – 2.5 million records 2. Workload A: Update heavy workload – 50%/50% Reads/Writes – 25 million operations 3. Workload B: Read mostly workload – 95%/5% Reads/Writes – 25 million operations Client Load: Throughput and latency measured over incrementally increasing loads generated from the client. This was done by increasing the number of YCSB client load threads, starting at 8 and growing in multiples of 2 ✔ ✔ ✔ [ ID] Interval Transfer Bandwidth Retr [ 4] 0.00-60.00 sec 8.99 GBytes 1.29 Gbits/sec 146 sender [ 4] 0.00-60.00 sec 8.99 GBytes 1.29 Gbits/sec receiver Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 12

Slide 12 text

Results: Insert Workload Performance ✔ 100% Writes ✔ 2.5M Throughput/Latency The number of records was selected to ensure the total memory was used in the event Redis that did not exceed 80% (since Redis is the memory hog, see Appendix B). Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 13

Slide 13 text

Insert Workload Observations For MongoDB, CPU was saturated by 32 threads onwards. Greater than 300% usage with single-digit idle percentages. For Redis, CPU utilization never crossed 95%. ✔ ✔ Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 14

Slide 14 text

Results: Workload A Performance ✔ 50/50% Read/Update ✔ 25M Throughput/Latency Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 15

Slide 15 text

Workload A Observations For MongoDB, CPU was saturated by 32 threads onwards. Greater than 300% usage with single-digit idle percentages. For Redis, CPU utilization never crossed 95%. For Redis, by 64 threads and above, runs failed often with read-timeout exceptions. ✔ ✔ ✔ Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 16

Slide 16 text

Results: Workload B Performance ✔ 95/5% Read/Update ✔ 25M Throughput/Latency ✔ Cache-reading workload Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 17

Slide 17 text

Workload B Observations For MongoDB, CPU was saturated by 32 threads onwards. Greater than 300% usage with single-digit idle percentages. For Redis, CPU utilization never crossed 95%. So, Redis was consistently performing better than MongoDB while running on a single thread, while MongoDB was saturating all the cores of the machine. For Redis, at 128 threads, runs failed often with read-timeout exceptions. ✔ ✔ ✔ Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 18

Slide 18 text

Appendices Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 19

Slide 19 text

Appendice A: Single-Thread Performance Even though it’s not very useful in real-world conditions, who would be better when applying the same load to each of them from a single thread. That is, how would a single-threaded application perform? (ops/sec) Insert 2.5M Workload A 25M Workload B 25M MongoDB Percona Throughput 2998.910796 3420.613713 3551.311407 Redis Throughput 2196.184262 4376.60512 4286.422158 Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 20

Slide 20 text

Appendice B: Database Size The default format of records inserted by YCSB are: each record is of 10 fields and each field is 100 bytes. Assuming each record to be around 1KB, the total expected size in memory would be upwards of 2.4GB. There was a stark contrast in the actual sizes as seen in the databases. Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 21

Slide 21 text

MongoDB > db.usertable.count() 2500000 > db.usertable.findOne() { "_id" : "user6284781860667377211", "field1" : BinData(0,"OUlxLllnPC0sJEovLTpyL18jNjk6ME8vKzF4Kzt2OUEzMSEwMkBvPytyODZ4Plk7KzRmK0FzOiYoNFU1O185KFB/IVF7LykmPkE9NF1pLDFoNih0KiIwJU89K0ElMSAgKCF+Lg=="), "field0" : BinData(0,"ODlwIzg0Ll5vK1s7NUV1O0htOVRnMk53JEd3KiwuOFN7Mj5oJ1FpM11nJ1hjK0BvOExhK1Y/LjEiJDByM14zPDtgPlcpKVYzI1kvKEc5PyY6OFs9PUMpLEltNEI/OUgzIFcnNQ=="), "field7" : BinData(0,"N155M1ZxPSh4O1B7IFUzJFNzNEB1OiAsM0J/NiMoIj9sP1Y1Kz9mKkJ/OiQsMSk2OCouKU1jOltrMj4iKEUzNCVqIV4lJC0qIFY3MUo9MFQrLUJrITdqOjJ6NVs9LVcjNExxIg=="), "field6" : BinData(0,"Njw6JVQnMyVmOiZyPFxrPz08IU1vO1JpIyZ0I1txPC9uN155Ij5iPi5oJSIsKVFhP0JxM1svMkphL0VlNzdsOlQxKUQnJF4xPkk9PUczNiF8MzdkNy9sLjg6NCNwIy1sKTw6MA=="), "field9" : BinData(0,"KDRqP1o3KzwgNUlzPjwgJEgtJC44PUUlPkknKU5pLzkuLEAtIlg9JFwpKzBqIzo2MCIoKTxgNU9tIz84OFB/MzJ4PjwoPCYyNj9mOjY+KU09JUk1I0l9O0s/IEUhNU05NShiNg=="), "field8" : BinData(0,"NDFiOj9mJyY6KTskO0A/OVg/NkchKEFtJUprIlJrPjYsKT98JyI8KFwzOEE7ICR4LUF9JkU1KyRkKikoK0g3MEMxKChsL10pKkAvPFRxLkxhOlotJFZlM0N/LiR4PjlqJ0FtOw=="), "field3" : BinData(0,"OSYoJTR+JEp9K00pKj0iITVuIzVqPkBpJFN9Myk4PDhqOjVuP1YhPSM2MFp/Kz14PTF4Mlk3PkhzKlx3L0xtKjkqPCY4JF0vIic6LEx7PVBzI0U9KEM1KDV4NiEuKFx5MiZyPw=="), "field2" : BinData(0,"Njd8LywkPlg9IFl7KlE5LV83ISskPVQpNDYgMEprOkprMy06LlotMUF5LDZ0IldzLl0tJVkjMTdgJkNxITFsNismLDxuIyYoNDgsLTc+OVpzKkBlMDtoLyBgLlctLCxsKzl+Mw=="), "field5" : BinData(0,"OCJiNlI1O0djK1BtIyc4LEQzNj9wPyQiPT8iNE1pODI2LShqNDg4JF1jNiZiNjZuNE5lNzA8OCAgMDp2OVkjNVU3MzIuJTgkNDp0IyVkJyk6IEEvKzVyK1s9PEAhKUJvPDxyOw=="), "field4" : BinData(0,"OFN1I0B7N1knNSR2LFp7PjUyPlJjP15jIUdlN0AhNEkhMC9+Lkd5P10jO1B3K10/I0orIUI1NzYuME81I0Y1NSYkMCxyI0w/LTc8PCEgJUZvMiQiIkIhPCF4LyN6K14rIUJlJg==") } > db.runCommand({ dbStats: 1, scale: 1 }) { "db" : "ycsb", "collections" : 1, "objects" : 2500000, "avgObjSize" : 1167.8795252, "dataSize" : 2919698813, "storageSize" : 2919698813, "numExtents" : 0, "indexes" : 1, "indexSize" : 76717901, "ok" : 1 } So, the space taken is ~2.7GB which is pretty close to what we expected. Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 22

Slide 22 text

Redis > info keyspace # Keyspace db0:keys=2500001,expires=0,avg_ttl=0 127.0.0.1:6379> RANDOMKEY "user3176318471616059981" 127.0.0.1:6379> hgetall user3176318471616059981 1) "field1" 2) "#K/:p:O=?<:(;v/)0)Yw.W!8]+4B=8.z+*4!" 3) "field2" 4) "(9<9P5**d7h=>p\"X9:Qo#C9:;z.Xs=Wy*H3/Fe&0`8)t.Ku0Q3)E#;Sy*C).Sg++t4@7-" 5) "field5" 6) "#1 %8x='l?5d38~&U!+/b./b;(6-:v!5h.Ou2R}./(*)4!8>\"B'!I)5U?0\" >Ro.Ru=849Im+Qm/Ai(;:$Z',]q:($%&(=3~5(~?" 7) "field0" 8) "+\"(1Pw.>*=807Jc?Y-5Nq#Aw=%*57r7!*=Tm!-X}/X#.U) )f9-~;?p4;p*$< D-1_s!0p>" 9) "field7" 10) ":]o/2p/3&(!b> |#:0>#0-9b>Pe6[}.|33$ Vo*M%=\"<$&j%/<5]%\".h&Kc'5.46x5D35'0-3l:\"| !l;" 13) "field6" 14) "-5x6!22)j;O=?1&!:&.S=$;|//r'?d!W54(j!$:-H5.*n&Zc!0f;Vu2Cc?E{1)r?M'!Kg'-b0.B#1" 15) "field9" 16) "(Xa&1t&Xq\"$((Ra/Q9&\": &>4Ua;Q=!T;(Vi2G+)Uu.+|:Ne;Ry3U\x7f!B\x7f>O7!Dc;V7?Eu7E9\"&<-Vi>7\"$Q%%A%1<2/V11: :^c+" 17) "field8" 18) "78(8L9.H#5N+.E5=2`=C-/Ka7<$;6r#_u F9)G/?;t& x?D%=Ba Zk+]) ($=I%3P3$<`>?*=*r9M1-Ye:S%%0,(Ns3,0'A\x7f&Y12A/5" 127.0.0.1:6379> info memory # Memory used_memory:6137961456 used_memory_human:5.72G used_memory_rss:6275940352 used_memory_rss_human:5.84G used_memory_peak:6145349904 used_memory_peak_human:5.72G total_system_memory:7844429824 total_system_memory_human:7.31G used_memory_lua:37888 used_memory_lua_human:37.00K maxmemory:7516192768 maxmemory_human:7.00G maxmemory_policy:noeviction mem_fragmentation_ratio:1.02 mem_allocator:jemalloc-3.6.0 At peak usage, Redis seems to take around 5.72G of memory, 2X as much as MongoDB (comparison may not be perfect due to differences in the databases, but still too large to ignore). YCSB inserts record in a hash in Redis, and an index is maintained in a sorted set. Since an individual entry is larger than 64, the hash is encoded normally and there is no savings in space. Redis performance comes at the price of increased memory footprint. MongoDB might be preferable of you want to reduce your memory costs. Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 23

Slide 23 text

Appendice C: Network Throughput An in-memory database server is liable to be either compute-bound or network I/O-bound, so it was important throughout the entire set of these tests to ensure that we were never getting network-bound. Measuring network throughput while running application throughput tests adversely affects the overall throughput measurement. So, we ran subsequent network throughput measurements using iftop at the thread counts where the highest write throughputs were observed. This was found to be around 440 Mbps for both Redis and MongoDB at their respective peak throughput. Given our initial measurement of the maximum network bandwidth to be around 1.29 Gbps, we are certain that we never hit the network bounds. In fact, it only supports the inference that if Redis were multi-core, we might get much better numbers. Read the Redis vs. MongoDB Blog Database-as-a-Service |

Slide 24

Slide 24 text

Tha s r a g Redis vs. MongoDB: Comparing In-Memory Databases with Percona Memory Engine You may also like: Cassandra vs. MongoDB Which is the Best MongoDB GUI? Fast Paging with MongoDB Using Redis with Node.js & Socket.IO ScaleGrid’s fully managed Database-as-a-Service (DBaaS) solution is used by thousands of developers, startups and enterprise customers, including UPS, Dell and Adobe. Easily manage MongoDB and Redis in the cloud with AWS, Azure, and DigitalOcean, and automate your database operations at any scale so you can focus on product. scalegrid.io | help.scalegrid.io | @scalegridio