Tracing

Notes on Co-evolving Tracing and Fault Injection with Box of Pain

This paper explores how related tracing and fault injection systems are, and if they should be part of the same thing. The space of possible executions of a distributed system is exponential in the number of communicating precesses and the number of messages, […] […] some of the most pernicious bugs in distributed programs involve mistakes on how programs handle partial failure of remote components. In order to expose this failures, fault injection mechanisms are used to cause network partitions, or machine crashes.
Read more
papers distributed systems tracing fault injection peter alvaro daniel bittman ethan l miller

Distributed Tracing with OpenTracing @ NoSlidesConf 2017

Presented at NoSlidesConf 2017
Read more
operations tracing

Tracing Kafka applications

for a more updated version, check https://jeqo.github.io/posts/2019-03-26-importance-of-distributed-tracing-for-apache-kafka-based-applications/ Tracing is one of the hardest time in integration or microservice development: knowing how a request impact your different components, and if your components have behave as expected. This could be fairly easy if we have monolith where we have one database and with some queries or checking one log file you can validate everything went well. Once you introduce distributed components and asynchronous communication this starts to get more complex and tedious.
Read more
integration observability kafka tracing