Big Data

Big Data

Reducing SQL Server IO and Access Times using Bloom Filters – Part 3 (Inserting Data)

Part 2 (Basics of the method in SQL Server) explained how to get data into a Bloom Filter structure, it now needs persisting. This post explains a method on how a Bloom Filter can be stored in a SQL Server database – I assume you have read Part 1 and Part 2 and understand about […]

Reducing SQL Server IO and Access Times using Bloom Filters – Part 2 (Basics of the method in SQL Server)

Part 1 addressed Bloom Filter Concepts, if you haven’t already done so its important to start there. In this post I will show the basics of how we set and query the bit array that holds our Bloom Filter structure. Step 1 – Hash the target Data element (key) Multiple hash functions are used over your […]

Overview of HADOOP, new features in Version 1, Version 2 branch description

Introduction The essay assumes the reader has no knowledge of Hadoop or Map Reduce; it will give an overview of Hadoop and the confusion around the project branch that form v1.0.0 and v2.0.0, the essay will also give discussion of the features introduced in v1.0.0 and some of the use-cases that they can be applied […]