About Tony Rogerson

Posts by Tony Rogerson:

Reducing SQL Server IO and Access Times using Bloom Filters – Part 1 (Concepts)

Given a 10 million row table with a GUID as a primary key, we have a 50,000 row table that we want to look up to see if we have any matching rows and for those matching rows aggregate the data – lets assuming that 50% of the rows have a corresponding match – so […]

Overview of HADOOP, new features in Version 1, Version 2 branch description

Introduction The essay assumes the reader has no knowledge of Hadoop or Map Reduce; it will give an overview of Hadoop and the confusion around the project branch that form v1.0.0 and v2.0.0, the essay will also give discussion of the features introduced in v1.0.0 and some of the use-cases that they can be applied […]

Erlang Recursion basics

I’m loving Erlang, extremely powerful as I get more into it, however, it did take a bit of getting my head round, but once you start to understand function pattern matching, data guards and recursion then you are on your way. This post is really for my own sanity, as a checkpoint if I get stuck […]

Erlang – Running Concurrent Queries against Microsoft SQL Server

Running Erlang on Windows, you can connect to Microsoft SQL Server using Erlang’s own ODBC application. It’s extremely easy and straightforward (example below), you can use a DSNless connection so you don’t even need to set up an entry through the Windows ODBC Data Source Administrator. 1 2 3 4 5 6 7 8 9 […]

Hard Disk Short-stroking (Part 2): RAID 0 (Striping) – Understanding why disk combinations matter

Introduction In part 1 (Understanding the physical performance characteristics of hard disks) we looked at why the physical properties of hard drives affects throughput i.e. more data can be read from tracks on the outer edge of the disk than the inner thus causing less disk head movement. In this article I will show what happens when we start […]

Short-stroking: Understanding the physical performance characteristics of Hard Disks (Part 1)

Article Series As part of my thesis for my Business Intelligence masters (see http://reportingbrick.com) I did a lot of research around Solid State and Hard Disk drives, this series of articles reflects that research. First I’ll talk about the physical properties of hard drives and progress through to its affect on how SQL Server performs […]

Welcome to Data Idol 0

Throughout my career the need by business to gain value from data has been a constant; I remember back in 93 writing EIS (Executive Information Systems) and MIS (Management Information Systems) from what today we term structured data, back then it was difficult because there weren’t many tools available; today that has changed, the so […]