Blog Posts

Convergence vs Consensus in Distributed Systems

Oct 20, 2025
Convergence and consensus are two closely-related properties of distributed systems implementing the replicated state machine (RSM) abstraction. While convergence requires replicas to eventually agree on the value of a decision variable, consensus requires them to never disagree. This subtle distinction makes all the difference in practice: while convergence can be...
Read more →
Vagaries of Git Merge

Nov 25, 2019
Git’s merge algorithm seems to have inexplicable semantics leading to some interesting cases. I describe a couple of examples below. The bottom line is that: Git merge is inconsistent: Merge result seems to depend not just on what versions were merged, but also how they were merged. Thus two branches...
Read more →
First-Class Modules and Modular Implicits in OCaml

Sep 25, 2017
I was pleasantly surprised to discover that OCaml has been supporting modules as first-class objects since v3.12 (2011). Intuition suggests that first-class modules should be expressive enough to simulate Haskell-style typeclasses in OCaml. Turns out this is the route taken by Leo White et al to introduce adhoc polymorphism via...
Read more →
Effective Serializability for Eventual Consistency

Nov 6, 2016
This post is a collection of my notes on Lucas Brutschy et al’s paper “Effective Serializability for Eventual Consistency”. A later version of this paper has been accepted to POPL’17. Introduction Serializability is a well-understood criterion to reason about concurrent transactions. Enforcing serializability via pessimistic concurrency control techniques, such as...
Read more →
Dynamo and DynamoDB

Sep 20, 2016
In this post, I discuss DeCandia et al’s Dynamo paper, and Amazon’s DynamoDB service based on the paper. Dynamo DeCandia et al’s Dyanamo is a distributed key-value store remarkable for it’s entirely decentralized architecture, SLAs that focus on 99.9th percentile latency, emphasis on never losing writes, and the notorious sloppy...
Read more →
Disciplined Inconsistency

Aug 19, 2016
Today, in our reading group, we read an interesting paper titled “Disciplined Inconsistency” by Brandon Holt et al from UW CSE. This post is my notes on the paper. Background Modern day web-services often trade consistency for availability and performance. However, there exist some data and operations for which stronger...
Read more →
Extraction in Coq

May 31, 2016
Extraction erases Props Extraction in Coq works by erasing Props. For example, consider the following definition of div: Definition div (m n : nat)(H : n<>0): nat := NPeano.div m n. div expects a proof that its second argument is non-zero. Indeed, in coq, it is impossible for div to...
Read more →
Notes - Terry's Session Guarantees

Dec 5, 2015
This post is the compilaton of my notes on Terry et al’s PDIS’94 paper: Session Guarantees for Weakly Consistent Replicated Data. System Model From what I understand, the paper is the first to describe the abstract system model of a weakly consistent replicated database that now serves as a frame...
Read more →
Effing Package Management (FPM)

Oct 5, 2015
If you are ever into making debian packages to distribute your software, check out this great package management tool called fpm by @jordansissel. FPM is no-nonsense package manager that lets you create packages by simply specifying dependencies, and source and destination paths for your binaries, libraries and includes. Example Let...
Read more →
Understanding Transactions in Rails

Sep 30, 2015
In my previous post I have noted that Rails encourages application developers to rely on feral mechanisms, such as validations and associations, to ensure application integrity. In this post, I first explore various feral mechanisms in Rails, and how they are being used by some sample applications. Next, I will...
Read more →
Understanding Transactions in Quelea

Sep 28, 2015
Quelea is our eventually consistent data store with an associated programming framework intended to simplify programming under eventual consistency. In this post, I describe how various applications written in Quelea employ a combination of highly available and serializable transactions to enforce application integrity. Three applications participate in this survey: BankAccount:...
Read more →
Notes - Feral Concurrency Control

Sep 24, 2015
This post is a compilation of my notes on Peter Bailis et al’s SIGMOD’15 paper: Feral Concurrency Control: An Empirical Investigation of Modern Application Integrity. Background Modern relational DBMs offer a range of primitives to help the developer ensure application integrity, even under the presence of concurrency: built-in integrity constraints...
Read more →
Atomicity vs Isolation

Jul 31, 2015
From the perspective of a transaction, Isolation: How should I see effects of other transactions. Atomicity: How other transactions see my effects.
Read more →
ML Type Inference

Jul 25, 2015
One of the most useful features of ML-family languages (OCaml, Standard ML, F# etc) is the type inference. This post contains the notes I took when I was trying to understand the foundational principles of ML type inference. Damas-Milner Algorithm for Type Inference Let us take a look at the...
Read more →
SAT solving puzzles

May 10, 2015
Few months ago, Cheryl birthday puzzle has has been an internet phenomenon. If you found the puzzle tricky to solve, we are in the same boat. However, if you observe, the puzzle only requires us to apply simple logic; not number theory or complex arithmetic, just simple logic. What, then,...
Read more →
Notes - Static Contract Checking for Haskell

May 4, 2015
I latex’d some notes while reading Dana N. Xu et al’s POPL’09 paper: Static contract checking for Haskell.
Read more →
Notes - A Data-Driven Approach for Algebraic Loop Invariants

Jan 15, 2015
This post contains my notes on Rahul Sharma et al’s ESOP’13 paper: A Data-Driven Approach for Algebraic Loop Invariants. The paper proposes a neat approach of inferring algebraic loop invariants by observing concrete program states, and making use of techniques from linear algebra to gain insights. Following is an example...
Read more →
Sequential Consistency and Datarace freedom in Weak Memory Models

Nov 30, 2014
A natural view of execution of a multi-threaded program is as follows: exn = [] while (there is an unfinished thread) { t = select an unfinished thread; instr = first(t); exn := exn ++ [instr]; t := rest(t); } return exn; Observe that exn preserves program order of all...
Read more →
SC vs Linearizability

Sep 23, 2014
Sequential consistency requires that all data operations appear to have executed atomically in some sequential order that is consistent with the order seen at every individual process. If instead of individual data operations, we apply sequential consistency to transactions, the resultant condition is called serializability in database theory. Linearizability imposes...
Read more →
Notes - McCarthy's Lisp and Reynolds's Definitional Interpreters

Sep 15, 2014
This post is a compilation of my notes on two landmark papers: The Lisp paper: McCarthy J, Recursive Functions of Symbolic Expressions and Their Execution by Machine, CACM’60 Interpreters paper: Reynolds J C, Definitional Interpreters for Higher-Order Programming Languages, ACM’72 1.The Lisp Paper This paper by John McCarthy introduces the...
Read more →
CAP Theorem and Related

Sep 9, 2014
My intention in writing this note is to understand the relation between conventional model of distributed systems that they usually teach in the distributed systems course and the the distributed web services hosting replicated datatypes. Fault tolerance is a concern in the former, and it is studied separately from communication...
Read more →
Coq Basics

Apr 9, 2014
GADTs vs Inductive Datatypes Consider the following definition of Nat-indexed Vector GADT in OCaml and Haskell: OCaml: (* vec : * -> * -> * *) type (_,_) vec = | Nil : ('a,zero) vec | Cons : 'a * ('a,'b) vec -> ('a,'b succ) vec Haskell: data Vec ::...
Read more →