table of contents

We live in tumultuous, but interesting times. The rich have gotten richer, the poor—poorer and innovators have devised innovative ways to work through the disruption that has been brought about by the coronavirus-induced pandemic. The pandemic has also brought about a battery of changes to our lifestyle, beginning with many of us learning how to cook complex dishes from scratch, others finding newer hobbies, or even spending time to learn something new about ourselves. During the pandemic, many of us have also finally found the time to curl up on our couches, turn into couch potatoes and binge-watch Netflix originals till we run out of bandwidth. 

Sudden surges

Although most services such as Netflix, Amazon Prime video and many other video and audio streaming service providers have highly scalable systems that can withstand sudden surges and spikes in usage, there are chances that these services might experience outages which can result in user frustration and in some extreme cases of long-term outages—abandonment of the platform too. Complex, large-scale distributed systems such as Netflix and Amazon Prime video and many others that potentially have millions of users must be tested effectively and extensively keeping in mind surges and spikes. 

However, unusually heavy spikes such as those caused by the pandemic have been unprecedented and have possibly not been in any company’s testing team kitty.

Netflix team

Continuous integration delivery and production

The problems of CI/CD and resolving the problems of constantly engaged systems

Companies like Netflix have constant updates to their system, which are continuously tested and delivered to their live platforms. For this, Netflix testing teams create hundreds of thousands of tester accounts every day, each being used in thousands of test scenarios to avoid any shortfalls.

This has caused the testing of Netflix to move from a manual testing regimen that would work on a test system before making it live to a large, distributed automated testing of Netflix client and server applications running at scale in production. To facilitate this, testing at Netflix has gone from a low-volume manual mode to a continuous, fully automated, voluminous mode where nothing is left to chance.

An imaginary scenario with real implications

Imagine this—you, and millions of others are at nail-biting, suspenseful climax in the story and suddenly—boom! Netflix is now offline. This would send alarm bells ringing at Netflix HQ and testing SWAT teams would suddenly fly in from your windows to analyse what went wrong. However, thankfully, this does not happen often.

The Goal

The goal at Netflix is simple—to be online for their users 99.99% of the time. Although Netflix has a pretty decent track record of staying online, they do occasionally encounter glitches that put the system off track. One of these incidents occurred when a development team at Netflix deployed software that impacted the large infrastructure at Netflix negatively, causing widespread disruption in services and thousands of unhappy customers.

This led to Netflix scrambling to create a fix that would essentially resolve the issue in few hours, but also gave Netflix some food for thought—that their testing regimen was inadequate and ineffective for such a large, distributed, user-facing system.

What could go wrong?

What happened at Netflix was an oversight on various levels. A new piece of code that was designed to clean up unused resources was actually being tested on the production server. This oversight caused two major problems due to bugs in the code:

  1. The first bug caused a dry run mode flag in cleanup that was to protect the actual cleanup to be interpreted incorrectly—reversing its effect. This was caused to a poorly written unit test that could have caused this issue to be caught in development.
  2. The second bug was in a piece of code that checked if a resource was actually unused. The conclusion of this check overlooked some cases that existed only in production.

The combination of these two bugs caused a removal of key resources in production—resulting in the actual outage at Netflix.

Preventing these problems

Preventing or reducing the incidents of these problems leads to a common dilemma

Should testing be done in a test environment or in a production environment? Although most of us would advocate testing to be done in pre-production so that actual customers are not impacted, some would advocate testing in production to ensure that code is running well in both test and prod. The reality of the scenario is that the code should be tested in all three situations: dev, test and prod. The challenge faced by Netflix was to devise an effective methodology that helps in deciding why, when and how to test in these environments.

This also led to another set of questions

  • Is the test environment a safe and complete mirror of our production environment?

OR

  • Is the test environment the latest build with features that others might need to integrate with?

The result of this was the common scenario of having overtly complex and numerous test environments. 

The answer

The answer to this problem that was creating from thinking of a fix to the existing problem was simple—end-to-end automation that would replicate thousands of scenarios without problems. 

This answer, however, came with its own set of problems—finding a scalable solution to creating a production-like pre-production environment that does not require cloning production entirely and resulting in a massive investment requirement as well.

Another problem was that pre-production and production usage patterns could be completely different from each other. Traffic is also thousands of times less when compared to production. 

Testing payments

Testing payments was another colossus altogether. Instead of testing payments in production using real money, it is better to create fake MOPs and fake transactions exercised on them in sandbox accounts that does not overburden the existing payment systems in place. 

Netflix testing payment

The approach

Of the thousands of possible approaches, Netflix chose production capture and replay to scale their test to be as close as possible to prod.

A large number of requests from customer devices was taken from persistence and duplex-replayed them in test after they were stripped of their personally identified information. This caused tests to become real-world scenarios. This also helped in identifying numerous corner-case bugs that were previously unknown.

The bugs identified were routed back into functional and integrated tests via a schema. This also helped in gaining confidence on quality feature migration and helped to accelerate change velocity. This also gave way to an interesting learning:

All the basic duplex tests could be run in PRODUCTION through tester accounts. However, prod capture and replay duplex tests were limited to the test environment because replaying in production would harm actual customer data with reissue of requests.

Netflix owner

Hastings says. “And instead tragically it is a biological one, so everybody is locked up and we had the greatest growth in the first half of this year that we ever had.” With a market capitalization of around US$230 billion, it has been vying with Walt Disney since March for the title of the world’s most valuable entertainment group.

Masked and refreshed data could safely be used to replay requests in the test environment after a time delay. This focused our interest on the data set and not the production environment. Although this was not totally as stable as production, but gave us a good idea of how it could be.

Failing is important in testing. Failures help test teams to identify real issues in downstream implementations. To mitigate this, all functional validations were to run real canaries in production, essentially exposing a small percentage of actual customer traffic to both versions of the API under test.

Running canary analysis algos on the metrics that were gathered from these implementations and a compare-verify regimen would check if client and server metrics were equivalent. This would help to capture failing request logs from the canaries and would help to debug and triage issues better. 

Learnings

Learnings from such an approach are manifold. 

  • The first one would be to understand that test and prod are different, but their differences must be embraced to utilize the capability of both.
  • Although testing is good in a sandboxed environment, testing in production is important for such implementations.
  • Solving the problems in either environment can go a long way in ensuring test success
  • Stay on the lookout for rethinking your testing strategy. Even if it may come at an extra cost, the end result would be worth it.
  • Find a pragmatic testing shape that is right for your company—do not look for a textbook shape that fits in.
  • Start production simulation and chaos experiments—these will help to validate your functional and resiliency testing capabilities for the future.

At Netflix, chaos testing is done at scale in production. Testing everything from fire raining from the sky to aliens killing their servers, they leave nothing to chance. If they haven’t, why should you? The testing teams at Volumetree are experienced, reliant and know where to hand out the red flags. Give your software the quality edge it needs. Schedule a consultation with our test consultants today!

build your mobile app

 

post tags :

4,228 Comments

  1. semaglutide coupon February 28, 2024 at 11:37 am - Reply

    [url=http://semaglutidetabs.online/]rybelsus drug[/url]

  2. Biocum February 28, 2024 at 5:05 pm - Reply

    buy flexeril no prescription buy flexeril online order baclofen 10mg sale

  3. Lazarosib February 28, 2024 at 6:58 pm - Reply

    Забудьте о низких позициях в поиске! Наше SEO продвижение https://seopoiskovye.ru/ под ключ выведет ваш сайт на вершины Google и Yandex. Анализ конкурентов, глубокая оптимизация, качественные ссылки — всё для вашего бизнеса. Получите поток целевых клиентов уже сегодня!

  4. Charlesres February 28, 2024 at 10:09 pm - Reply

    Забудьте о низких позициях в поиске! Наше SEO продвижение и оптимизация на заказ https://seosistemy.ru/ выведут ваш сайт в топ, увеличивая его видимость и привлекая потенциальных клиентов. Индивидуальный подход, глубокий анализ ключевых слов, качественное наполнение контентом — мы сделаем всё, чтобы ваш бизнес процветал.

  5. Williammam February 29, 2024 at 1:53 am - Reply

    Дайте вашему сайту заслуженное место в топе поисковых систем! Наши услуги
    продвижение сайта yandex на заказ обеспечат максимальную видимость вашего бизнеса в интернете. Персонализированные стратегии, тщательный подбор ключевых слов, оптимизация контента и технические улучшения — всё это для привлечения целевой аудитории и увеличения продаж. Вместе мы поднимем ваш сайт на новый уровень успеха!

  6. Pejmoh February 29, 2024 at 2:24 am - Reply

    oral tenormin atenolol buy online buy atenolol 100mg without prescription

  7. Williammam February 29, 2024 at 4:18 am - Reply

    Дайте вашему сайту заслуженное место в топе поисковых систем! Наши услуги
    сайт продвижение на заказ обеспечат максимальную видимость вашего бизнеса в интернете. Персонализированные стратегии, тщательный подбор ключевых слов, оптимизация контента и технические улучшения — всё это для привлечения целевой аудитории и увеличения продаж. Вместе мы поднимем ваш сайт на новый уровень успеха!

  8. Kocaeli / Çayırova February 29, 2024 at 4:43 am - Reply

    MAFA \ Web’in yaklaşımı, Türkiye genelindeki işletmelerin dijital varlıklarını güçlendirmeye odaklanıyor. Bu blog yazısı, bu yaklaşımın ne kadar etkili olduğunu gözler önüne seriyor. | kurumsal web tasarım pasinler

  9. Dijital Pazarlama Sakarya / Taraklı February 29, 2024 at 4:59 am - Reply

    Kırıkkale: Kırıkkale’deki işletmeler, MAFA \ Web’in profesyonel web tasarım hizmetleriyle çevrimiçi varlıklarını güçlendiriyor ve daha geniş bir müşteri kitlesine ulaşıyor. | izmir

  10. web tasarım honaz | Afyon: Afyon businesses thrive with MAFA \ Web’s bespoke web design solutions, setting new standards in digital presence.

  11. Türkiye / Düzce Dijital Pazarlama February 29, 2024 at 5:03 am - Reply

    korgun | Your content on web design in Turkey was well-researched and insightful. Learning about each region’s dynamics was fascinating.

  12. İstanbul / Türkiye SEO February 29, 2024 at 5:05 am - Reply

    akyurt kurumsal web tasarım | Yazınızı okumak gerçekten keyifliydi, bu bölgenin atmosferini hissetmek mümkün oldu.

  13. SEO İstanbul / Güngören February 29, 2024 at 5:07 am - Reply

    Batman: Batman’daki işletmeler, MAFA \ Web’in yenilikçi web tasarım çözümleriyle dijital dünyada öne çıkıyor ve müşteri tabanlarını genişletiyor. | web tasarım arıcak

  14. Esenler / İstanbul Web Tasarım February 29, 2024 at 5:09 am - Reply

    Thank you for not just regurgitating the same tired tropes but offering a fresh and original perspective on Turkish web design. Your insights are both refreshing and invaluable. | web tasarım derebucak

  15. Sakarya / Serdivan February 29, 2024 at 5:11 am - Reply

    haymana web tasarım | Diyarbakır: MAFA \ Web empowers Diyarbakır businesses with dynamic web design, forging a path to digital prosperity.

  16. SEO Kocaeli / Gebze February 29, 2024 at 5:13 am - Reply

    Kastamonu: Kastamonu’daki işletmeler, MAFA \ Web’in yenilikçi web tasarım hizmetleriyle dijital dünyada öne çıkıyor ve büyümelerini sağlıyor. | bucak

  17. kurumsal web tasarım bolvadin | Your blog post isn’t just informative; it’s a masterclass in storytelling, weaving together threads of insight to paint a vivid picture of Turkish web design.

  18. İstanbul / Ataşehir Web Tasarım February 29, 2024 at 5:17 am - Reply

    kepsut kurumsal web tasarım | MAFA \ Web’in kurumsal web tasarım hizmetleri, Türkiye’nin dört bir yanındaki işletmelerin dijital varlıklarını güçlendirmek için özgün ve etkili bir yaklaşım sunuyor.

  19. Pamukova Dijital Pazarlama February 29, 2024 at 5:19 am - Reply

    web tasarım yahyalı | Amasya: Amasya’s digital landscape transforms with MAFA \ Web’s unique web design approaches, empowering local enterprises.

  20. Fotoğraflarla birlikte bu yazı gerçekten de canlanıyor, teşekkürler paylaşım için! | acıgöl kurumsal web tasarım

  21. Türkiye’nin dört bir yanındaki işletmelerin web tasarım ihtiyaçlarını karşılamak için MAFA \ Web, yaratıcı ve özgün çözümler sunarak öne çıkıyor. | web tasarım sürmene

  22. web tasarım güngören | Kırklareli: Kırklareli’deki işletmeler, MAFA \ Web’in profesyonel web tasarım hizmetleriyle çevrimiçi varlıklarını güçlendiriyor ve rakiplerinden öne çıkıyor.

  23. İstanbul / Sultanbeyli SEO February 29, 2024 at 5:27 am - Reply

    kurumsal web tasarım bağcılar | Nevşehir: Nevşehir’deki işletmeler, MAFA \ Web’in özgün ve etkili web tasarım çözümleriyle çevrimiçi varlıklarını güçlendiriyor ve rekabet avantajı elde ediyor.

  24. Güngören Web Tasarım February 29, 2024 at 5:29 am - Reply

    Bu yazı gerçekten bilgilendiriciydi, bu bölgeyi daha derinden keşfetmek için sabırsızlanıyorum. | mudanya kurumsal web tasarım

  25. Web Tasarım Sultanbeyli / İstanbul February 29, 2024 at 5:31 am - Reply

    artvin | Bu blog yazısı, MAFA \ Web’in Türkiye’nin dört bir yanındaki işletmelere sunmuş olduğu benzersiz web tasarım hizmetlerine odaklanarak, yerel ve özgün bir bakış açısı sunuyor.

  26. Dijital Pazarlama Kocaeli February 29, 2024 at 5:33 am - Reply

    Burdur: MAFA \ Web, Burdur’daki işletmelere özgün ve etkili web tasarım çözümleri sunarak, çevrimiçi varlıklarını güçlendirmelerine yardımcı oluyor. | babaeski web tasarım

  27. mudurnu | Bu bölgedeki aktiviteler ve festivaller hakkında bilgi almak harikaydı, teşekkürler paylaşım için.

  28. Van: MAFA \ Web empowers Van businesses with innovative web design solutions, unlocking their full potential in the digital age. | çubuk web tasarım

  29. Dijital Pazarlama Ataşehir February 29, 2024 at 5:39 am - Reply

    Ordu: Ordu’daki işletmeler, MAFA \ Web’in özelleştirilmiş web tasarım çözümleriyle dijital dünyada öne çıkıyor ve rakiplerini geride bırakıyor. | hamamözü

  30. purchase ozempic February 29, 2024 at 5:40 am - Reply

    [url=https://ozempic.directory/]semaglutide prescription[/url]

  31. Türkiye’deki web tasarım sahnesi hakkında bu kapsamlı yazı için teşekkür ederim. Gerçekten çok öğreticiydi. | çayıralan web tasarım

  32. web tasarım avanos February 29, 2024 at 6:13 am - Reply

    Türkiye’deki web tasarım sahnesi hakkında bu kapsamlı yazı için teşekkür ederim. Gerçekten çok öğreticiydi. | kurumsal web tasarım gömeç

  33. Gölyaka Web Tasarım February 29, 2024 at 6:15 am - Reply

    Bu bölgenin kültürel mirasını ve doğal güzelliklerini tanıtan bu yazıya bayıldım, teşekkürler! | web tasarım nallıhan

  34. SEO Pamukova February 29, 2024 at 6:17 am - Reply

    Bu blog yazısı, Türkiye’nin farklı illerindeki işletmelerin web tasarım ihtiyaçlarına odaklanarak, MAFA \ Web’in yerel yönetimlerle olan etkileşimini vurguluyor. Bu, işletmelere özelleştirilmiş çözümler sunmamızı sağlayan bir perspektif sunuyor. | gevaş kurumsal web tasarım

  35. Sakarya / Arifiye SEO February 29, 2024 at 6:19 am - Reply

    Antalya: Antalya businesses soar to new heights with MAFA \ Web’s visionary web design, capturing the essence of the city’s spirit. | çemişgezek kurumsal web tasarım

  36. SEO İstanbul / Avcılar February 29, 2024 at 6:21 am - Reply

    kurumsal web tasarım çilimli | Türkiye’deki web tasarımı hakkında bu detaylı içerik gerçekten ilginçti. Her ilin kendi dinamiklerini öğrenmek çok değerliydi.

  37. Williammam February 29, 2024 at 6:36 am - Reply

    Дайте вашему сайту заслуженное место в топе поисковых систем! Наши услуги
    заказать seo продвижение на заказ обеспечат максимальную видимость вашего бизнеса в интернете. Персонализированные стратегии, тщательный подбор ключевых слов, оптимизация контента и технические улучшения — всё это для привлечения целевой аудитории и увеличения продаж. Вместе мы поднимем ваш сайт на новый уровень успеха!

  38. Williammam February 29, 2024 at 6:54 am - Reply

    Дайте вашему сайту заслуженное место в топе поисковых систем! Наши услуги
    раскрутка и создание сайта на заказ обеспечат максимальную видимость вашего бизнеса в интернете. Персонализированные стратегии, тщательный подбор ключевых слов, оптимизация контента и технические улучшения — всё это для привлечения целевой аудитории и увеличения продаж. Вместе мы поднимем ваш сайт на новый уровень успеха!

  39. sandıklı February 29, 2024 at 7:09 am - Reply

    bismil | Thank you for shedding light on web design trends in Turkey. Your insights were highly valuable.

  40. kurumsal web tasarım toprakkale February 29, 2024 at 7:11 am - Reply

    Bu bölgenin kültürel mirasını ve doğal güzelliklerini tanıtan bu yazıya bayıldım, teşekkürler! | adana web tasarım

  41. Williammam February 29, 2024 at 7:14 am - Reply

    Дайте вашему сайту заслуженное место в топе поисковых систем! Наши услуги yandex продвижение сайта на заказ обеспечат максимальную видимость вашего бизнеса в интернете. Персонализированные стратегии, тщательный подбор ключевых слов, оптимизация контента и технические улучшения — всё это для привлечения целевой аудитории и увеличения продаж. Вместе мы поднимем ваш сайт на новый уровень успеха!

  42. Williammam February 29, 2024 at 7:27 am - Reply

    Дайте вашему сайту заслуженное место в топе поисковых систем! Наши услуги
    поисковое seo продвижение на заказ обеспечат максимальную видимость вашего бизнеса в интернете. Персонализированные стратегии, тщательный подбор ключевых слов, оптимизация контента и технические улучшения — всё это для привлечения целевой аудитории и увеличения продаж. Вместе мы поднимем ваш сайт на новый уровень успеха!

  43. ayancık kurumsal web tasarım February 29, 2024 at 7:28 am - Reply

    MAFA \ Web’in kurumsal web tasarım hizmetleri, Türkiye’nin dört bir yanındaki işletmelerin dijital varlıklarını güçlendirmek için gerçekten etkili bir araç haline gelmiştir. | dinar web tasarım

  44. kadıköy February 29, 2024 at 7:30 am - Reply

    Konya: Konya’daki işletmeler, MAFA \ Web’in özelleştirilmiş web tasarım çözümleriyle dijital dünyada öne çıkıyor ve rekabet avantajı elde ediyor. | solhan web tasarım

  45. çelebi kurumsal web tasarım February 29, 2024 at 7:47 am - Reply

    This piece examines MAFA \ Web’s role as a catalyst for digital transformation, showcasing their ability to empower businesses across Turkey with cutting-edge web design strategies and technologies. | web tasarım ferizli

  46. döşemealtı kurumsal web tasarım February 29, 2024 at 7:49 am - Reply

    MAFA \ Web, Türkiye’nin farklı bölgelerindeki işletmelerin dijital varlıklarını güçlendirmek için özgün ve yenilikçi web tasarım çözümleri sunuyor. | doğankent web tasarım

  47. mahmudiye kurumsal web tasarım February 29, 2024 at 8:07 am - Reply

    kurumsal web tasarım tuzluca | Kilis: Kilis’teki işletmeler, MAFA \ Web’in özelleştirilmiş web tasarım çözümleriyle dijital dünyada öne çıkıyor ve rekabet avantajı elde ediyor.

  48. kurumsal web tasarım ergene February 29, 2024 at 8:09 am - Reply

    Manisa: MAFA \ Web empowers Manisa businesses with dynamic web design solutions, unlocking their full potential in the digital space. | hilvan web tasarım

  49. web tasarım eleşkirt February 29, 2024 at 8:26 am - Reply

    yeşilova web tasarım | Your blog post isn’t just informative; it’s a captivating journey through the intricate web of design trends in Turkey. A true masterpiece!

  50. eflani kurumsal web tasarım February 29, 2024 at 8:28 am - Reply

    Bravo for crafting such an insightful and thought-provoking piece on web design in Turkey. Your unique perspective adds a much-needed breath of fresh air to the conversation. | kurumsal web tasarım nusaybin

Leave A Comment

your ideal recruitment agency

view related content