<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Blog on Siddharth Jain</title><link>https://www.siddharthjain.dev/posts/</link><description>Recent content in Blog on Siddharth Jain</description><generator>Hugo -- gohugo.io</generator><lastBuildDate>Wed, 27 Oct 2021 21:31:30 -0700</lastBuildDate><atom:link href="https://www.siddharthjain.dev/posts/index.xml" rel="self" type="application/rss+xml"/><item><title>Zanzibar: Google’s Consistent, Global Authorization System</title><link>https://www.siddharthjain.dev/posts/2021/zanzibar-consistent-auth-system/</link><pubDate>Wed, 27 Oct 2021 21:31:30 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2021/zanzibar-consistent-auth-system/</guid><description>Zanzibar is an authorization system developed at Google and used by more than 1500 client services, including Calendar, Cloud, Drive, Maps, Photos, and Youtube</description></item><item><title>FoundationDB: A Distributed Unbundled Transactional Key Value Store</title><link>https://www.siddharthjain.dev/posts/2021/foundationdb-transactional-key-value-store/</link><pubDate>Tue, 12 Oct 2021 20:45:09 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2021/foundationdb-transactional-key-value-store/</guid><description>FoundationDB is an open source key-value store and one of the first systems to combine the flexibility and scalability of NoSQL architectures with the power of ACID transactions.</description></item><item><title>Database Data Structures: LSM-Tree</title><link>https://www.siddharthjain.dev/posts/2020/designing-data-intensive-applications-sstables-lsm-trees-b-trees/</link><pubDate>Sat, 24 Oct 2020 19:29:58 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/designing-data-intensive-applications-sstables-lsm-trees-b-trees/</guid><description>An LSM-Tree is at the heart of storage engines that are based on the principle of merging and compacting sorted files. Such storage engines are typically called LSM storage engines.</description></item><item><title>Database Data Structures: Log-Structured Hash Table</title><link>https://www.siddharthjain.dev/posts/2020/designing-data-intensive-applications-hash-indexes/</link><pubDate>Sat, 17 Oct 2020 11:32:24 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/designing-data-intensive-applications-hash-indexes/</guid><description>Many databases internally use a &lt;em>log&lt;/em> which is an append-only sequence of records. These are &lt;em>log-structured&lt;/em> databases. Hash indexes are used to index key-value data, and may be used to power log-structured databases.</description></item><item><title>Data-Intensive Applications: Reliability, Scalability and Maintainability</title><link>https://www.siddharthjain.dev/posts/2020/designing-data-intensive-applications-reliability-scalability-maintainability/</link><pubDate>Fri, 02 Oct 2020 19:18:33 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/designing-data-intensive-applications-reliability-scalability-maintainability/</guid><description>A data-intensive application is one for which raw CPU power is rarely a limiting factor — it has bigger concerns over the amount of data, complexity of data and the speed at which the data is changing.</description></item><item><title>Information Retrieval : Basics</title><link>https://www.siddharthjain.dev/posts/2020/information-retrieval-part-1-boolean-retrieval/</link><pubDate>Sun, 05 Jul 2020 22:48:33 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/information-retrieval-part-1-boolean-retrieval/</guid><description>Information Retrieval (IR) is the science of searching for material in a collection of resources based on an information need. The most common form of an IR system is a text retrieval system that allows users to retrieve textual data in the form of a sorted list of documents using natural language queries.</description></item><item><title>ZooKeeper: Wait-free coordination for Internet-scale systems</title><link>https://www.siddharthjain.dev/posts/2020/zookeeper-wait-free-coordination-for-internet-scale-systems/</link><pubDate>Sun, 10 May 2020 21:49:20 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/zookeeper-wait-free-coordination-for-internet-scale-systems/</guid><description>ZooKeeper is a service that allows distributed processes to coordinate with each other using a shared name space of data registers. It exposes a wait-free interface and an event-driven mechanism to provide a simple and high-performance kernel for building distributed applications. Originally developed at &lt;em>Yahoo!&lt;/em>, it is now under the care of the &lt;em>Apache Software Foundation&lt;/em>.</description></item><item><title>Epidemic Algorithms for Replicated Database Maintenance</title><link>https://www.siddharthjain.dev/posts/2020/epidemic-algorithms-for-replicated-database-maintenance/</link><pubDate>Thu, 23 Apr 2020 22:29:39 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/epidemic-algorithms-for-replicated-database-maintenance/</guid><description>This post is a summary of the paper:
Epidemic algorithms for replicated database maintenance, Demers, et al., 1988
Introduction Published in the late 80&amp;rsquo;s, this paper lays out early ideas on gossip based replication algorithms and focuses on eventual consistency, in contrast to the traditional ACID model. The idea of eventual consistency or BASE is well-known these days, however, in the late 80&amp;rsquo;s, it was one of the most important novel ideas.</description></item><item><title>Web Scale Responsive Visual Search at Bing</title><link>https://www.siddharthjain.dev/posts/2020/web-scale-responsive-visual-search-at-bing/</link><pubDate>Sun, 23 Feb 2020 23:01:45 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/web-scale-responsive-visual-search-at-bing/</guid><description>This post is a summary of the paper:
Web-Scale Responsive Visual Search at Bing, Hu et al., 2018
Introduction Visual search is an interesting research area. A visual search system ranks a list of visually similar images when presented with a query image.
As in all search systems, latency and relevance of returned results are key metrics to evaluate such a system.</description></item><item><title>Unravelling C++ constexpr</title><link>https://www.siddharthjain.dev/posts/2020/unravelling-constexpr/</link><pubDate>Tue, 28 Jan 2020 20:24:04 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2020/unravelling-constexpr/</guid><description>Background C++ is an old language.
The excitement that C++ generated in the 90&amp;rsquo;s had faded by 2010, probably because of the emergence of several newer languages like Java and also because the Standards Committee had released only a few enhancements in the decade.
And then came C++11. The specification for C++11 was approved in 2011 (hence, the name).</description></item><item><title>Peloton: Resource Scheduling at Uber</title><link>https://www.siddharthjain.dev/posts/2019/peloton-resource-scheduling-at-uber/</link><pubDate>Sun, 15 Dec 2019 20:30:32 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2019/peloton-resource-scheduling-at-uber/</guid><description>Peloton is Uber&amp;rsquo;s cluster scheduler that is capable of co-scheduling mixed types of workloads such as batch, stateless and stateful jobs in a single cluster for better resource utilization. Designed for scaling to millions of containers and tens of thousands of nodes, it features advanced resource management capabilities like elastic resource sharing, hierarchical max-min fairness, workload preemption to name a few.</description></item><item><title>The Anatomy of a Large-Scale Hypertextual Web Search Engine</title><link>https://www.siddharthjain.dev/posts/2019/anatomy-of-large-scale-hypertextual-web-search-engine/</link><pubDate>Wed, 06 Feb 2019 22:43:09 -0700</pubDate><guid>https://www.siddharthjain.dev/posts/2019/anatomy-of-large-scale-hypertextual-web-search-engine/</guid><description>This is a well-known paper. It was published in 1998 and it describes Google as a prototype of a large-scale search engine. According to the authors, this paper was the first in-depth public description of a large-scale web search engine. The authors describe how to build a practical large-scale search system that exploits information present in hypertext, thus producing better search results than existing search systems.</description></item></channel></rss>