More

cwsteinbach · on Sept 17, 2013

Starlite, meet FOGBANK: http://en.wikipedia.org/wiki/Fogbank

cwsteinbach · on Sept 17, 2013

In 1996, the US government decided that large numbers of its nuclear weapons would require replacement, refurbishing, or decommissioning. Accordingly, the Department of Energy set up a refurbishment program aimed at extending the service lives of older nuclear weapons. In 2000, the National Nuclear Security Administration (NNSA) specified a life-extension program for W76 warheads that would enable them to remain in service until at least 2040.[2]

It was soon realized that the FOGBANK material was a potential source of problems for the program, as few records of its manufacturing process had been retained when it was originally manufactured in the 1980s, and nearly all staff members who had expertise in its production had either retired or left the agency. The NNSA briefly investigated sourcing a substitute for FOGBANK, but eventually decided that since FOGBANK had been produced previously, they would be able to repeat it.[2] Additionally, "Los Alamos computer simulations at that time were not sophisticated enough to determine conclusively that an alternate material would function as effectively as Fogbank," according to a Los Alamos publication.[3]

icambron · on Sept 17, 2013

Secrecy: not always a good idea.

cwsteinbach · on May 31, 2013

Another benefit of this approach is that it allows you to save storage space by eliminating the need to create a copy of the JSON data in the DB's own internal format.

cwsteinbach · on May 6, 2013

It's genuinely refreshing to have a senator who isn't embarrassed to admit that his newest legislative proposal is inspired by the plot of a Hollywood thriller.

cwsteinbach · on May 3, 2013

The CIA also covertly funded an "Anglo-American left-of-centre" literary journal named "Encounter".

More details here: http://en.wikipedia.org/wiki/Encounter_(magazine)

And here: https://www.cia.gov/library/center-for-the-study-of-intellig...

cwsteinbach · on May 2, 2013

Matt Taibbi practically makes a living writing takedowns of Thomas Friedman's Op/Ed pieces and books. Here's a selection:

http://nypress.com/flathead/ http://nypress.com/flat-n-all-that/

ebbv · on May 2, 2013

He likes calling bullshit on other writers, and it just so happens Friedman is an endless font of bullshit.

cwsteinbach · on May 2, 2013

Several years ago I had the idea of using Markov chain algorithms to auto-generate Thomas Friedman articles. I was going to build a site around it called mechanicalfriedman.com. I just discovered someone else beat me to it:

http://thomasfriedmanopedgenerator.com/

There goes my billion dollar exit...

cwsteinbach · on April 23, 2013

Author here. Believe it or not I originally had the compression ratio graph rotated 90 degrees, and had manually modified it to run from 0.00 to 1.00. Google docs for some god awful reason insists on starting at 0.2 by default. Anyway, when my colleagues reviewed a draft of this post they requested that I rotate the graph back, and in the process I forgot to reset the scale. Sorry for the confusion. It's fixed now. As for the definition of "compression ratio", I looked this up and went with the definition found here: http://en.wikipedia.org/wiki/Data_compression_ratio

I agree that it's kind of counterintuitive.

marshray · on April 24, 2013

Perhaps "file size on disk" would be an unambiguous way to put it.

cwsteinbach · on April 22, 2013

> as ZFS is likely caching the reads in the ARC

Each of the seven queries we used in our benchmark required a sequential scan of the 32GB dataset. It's unlikely that the ARC had any impact on the results since the EC2 instance had only 7GiB of memory.

cwsteinbach · on April 22, 2013

I wasn't aware that Reiser4 supported compression. Thanks for pointing that out. As for why we chose to use ZFS instead of Btrfs, we feel that ZFS is closer to being in a state where an enterprise customer would be comfortable deploying it in production. This is due to the fact that ZFS has been in development for over a decade with many Solaris sites already using it in production, and Btrfs is still marked as "unstable".

richardkmichael · on April 22, 2013

EDIT: I realize you said "near" and "closer" to production ready, but I think it's worth mentioning --

No FUD intended, but I don't consider ZFS on Linux production ready. Wanting to use ZFS, I recently started regularly reading their GitHub issues.

There are deadlocks and un-importable pools in certain situations (hard-links being one: think rsync). I would not want production boxes in the same predicaments experienced by several bug reporters. Moreover, applying debug and hot-fix (hopefully) kernel patches and the associated downtime in production is a no-go for me.

Mind you, the project leads are very responsive and it's making great strides.

In addition, I believe the Linux implementation currently lacks the L2ARC (which can make ZFS really fly, caching to SSDs).

However, I would absolutely run ZFS on Illumos or Solaris; for the stability and article-mentioned compression benefits.

dpe82 · on April 23, 2013

I'm using ZFS with L2ARC and write logs on an SSD on Ubuntu right now. Not sure I'd use it in production yet for the reasons you mention, but for things like my home workstation and office NAS it works great!

cwsteinbach · on April 22, 2013

While I can't claim that we logged CPU load while running these tests, I can say that I watched the output of top and iotop and that the CPU load was relatively light. It's also worth pointing out that Amazon describes the I/O performance of c1.xlarge instances as "high". We also considered using an hs1.8xlarge "High Storage" instance for these tests, but eventually decided that we were more interested in testing against conventional disks as opposed to SSDs.

lucian1900 · on April 23, 2013

Did you use instance storage? EBS? Provisioned IOPS?

There are vast differences between those three.