Archive

Uncategorized

In stealth no more, announcing Datanet: http://datanet.co/.

Datanet is an open source CRDT based data synchronization system.

Datanet aims to achieve ubiquitous write though caching. CRDT replication can be added to any cache in your stack, meaning data modifications to these stacks are globally & reliably replicated. Locally modifying data yields massive gains in latency, produces a more efficient replication stream, & is extremely robust. It’s time to prefetch data to compute.

This is the culmination of 2+ years of research and development and I am pretty proud of it 🙂

Datanet links:

website: http://datanet.co/
github: https://github.com/JakSprats/datanet
community: https://groups.google.com/forum/#!forum/crdtdatanet
twitter: https://twitter.com/CRDTDatanet

Author links:

twitter: https://twitter.com/jaksprats
linkedin: http://www.linkedin.com/in/russell-sullivan-a266096
blog: https://jaksprats.wordpress.com/

Quick link to slides

This is a concept I came up with in Jan 2014. Rumors of Google doing something similar to this concept started popping up recently, so it seemed like a good time to publish it.

Basic concept is to build a secondary cell network that complements the user’s existing (primary) cell network, offering reduced cellular data rates when a user is within the secondary cell network’s range. This secondary network would be significantly cheaper to construct as two major costs (backhaul & spectrum) are done more or less for free, by using Google Fiber as the backhaul and TV White Spaces & all unlicensed spectrum( 900MHz, 2.4 GHz, 5GHz) combined with best Cognitive radio practices for the network’s spectrum. Select Google Fiber customers would receive reduced Google Fiber rates for hosting the network’s Picocells. Additionally, each Google Fiber home would receive proprietary hardware in the form of a Wifi-router to implement the system’s Cognitive Radio. This proprietary hardware can be used to do other cool stuff.

But without further ado … here are the slides … the concept is a pretty cool one, pretty relevant, probably doable, and I expect some derivation of it to happen in real life pretty soon cuz Google has the rep for pushing things forward.

Quick link to slides

Short link for PRESENTATION

I am a huge fan of Google Glass’ concept. The idea of having information pop up that only you can see to add context to whatever is in your line of sight is a wildly interesting technology.

I bought a Google Glass about 4 months ago and was totally OK w/ paying $1500. Since purchasing the Glass I have swayed between being euphoric about some features (e.g. POV camera) and disappointed about the product’s many shortcomings.

Recently I started cataloging all the changes I would make to Glass and wrote up a (not perfectly organized) power point presentation. More recently Google announced and released Android wear, which is very much in line with some of my suggestions, so I felt validated that my ideas were relevant and thought I should share them.

Hopefully some of my other ideas are in line w/ what Google is thinking, because a technology like a Glass is a certainty to be in your household, but not in its current form.

Without further ado … CLICK FOR PRESENTATION

 

* On re-reading my presentation, it is pretty poorly organized. The interesting ideas start around page 20 and the even better ones start at page 30, which is to say the first 20 are boring 🙂

I wrote a blog on highscalability.com on how to get to 1 million Database TPS on $5000 worth of hardware (single machine). Hopefully there are some performance tips in the blog that people can use themselves.

http://highscalability.com/blog/2012/9/10/russ-10-ingredient-recipe-for-making-1-million-tps-on-5k-har.html

I also wrote a brag number post for Aerospike’s blog, and they let me quote the movie “Ricky Bobby”

http://www.aerospike.com/blog/all-about-speed/

I will update this blog every time I write blogs in other places, just seems like a  good idea

– Russ

Aerospike is the former Citrusleaf: http://www.dbms2.com/2012/08/27/aerospike-the-former-citrusleaf/

Citrusleaf acquired AlchemyDB and we are now incrementally porting AlchemyDB functionality to run on top of Citrusleaf’s proven distributed high-availability linearly-scalable key-value store. First functionalities planned are: Lua with DocumentStore functionality, Secondary Indexes, and Real-time map-reduce. Further down the road Pregel like Distributed GraphDB like functionality or some next generation StreamingDB may be integrated in. Incrementally building AlchemyDB on top of Citrusleaf will create a distributed computing fabric, functions can be shipped to data (that lies on a horizontally scalable low latency storage layer), and the functions can propagate their results across the fabric, calling other functions on them.

Full info at: http://www.aerospike.com/blog/alchemydb/

I get my information on databases, datastores, big-data, etc… primarily from 3 bloggers: Todd Hoff of High Scalability, Alex Popescu of myNOSQL, and Curt Monash of DBMS2. Each of them specialises in different areas and each has their own style and purpose, taken as a whole, they cover a lot of ground w/o a lot of dilution.

In 2-10 years you will look back at Todd Hoff’s blog High Scalability and it will contain everything that is happening right now. He has a keen insight into which technologies are fads and which technologies may lead to big changes, and he is not at all full of shit, which is next to impossible given this task. He dabbles in reporting on truly innovative technologies and he actually understands them. His style is strictly-facts and he rarely bad mouths people/ideas/stuff. His blog is the remedy for the hardened cynic that thinks we are making no important advances. His weekly “Stuff the Internet says on Scalability”, is ALWAYS good for at least 3 links to stuff I find interesting (which is 3 links higher than every other summarized email I get).

Alex Popescu is the go to guy for NOSQL. Alex Popescu’s blog myNSOSQL covers the ENTIRE NOSQL gambit (GraphDB’s, DocumentStores, KeyValueStores, Hadoop/Mapreduce, etc…). He quickly sees thru most of the bullshit in the NOSQL world, and clearly explains the differences in a movement that is full of confusion. He likes to tear people’s points apart, his points are valid and he also blasts NOSQL ideas/approaches/etc… when they have it coming. His tweet stream (@al3xandru) is a raging river of NOSQL information, it will keep you on top of the NOSQL game (once you learn how to wade thru it), plug into it, and you can be up on most of NOSQL in probably a month.

Curt Monash knows the RDBMS market, especially the analytics market, better than you know anything 🙂 He has been in the game forever, and he earns money w/ his blog DBMS2, so it cant be called 100% objective, but he has the type of abrasive personality that is only comfortable telling mostly the truth, so IMO the info in his blog is basically objective. RDBMS technologies are relatively very mature, advanced, and widespread, so having a good summary of what is going on in that market is a MUST for any fan of data. Monash is a definitions junky, which can be boring/tiring to read, but it represents a mature approach and does help make sense of such a large, complicated, and polluted-by-enterprise-generated-bullshit market. Having a guy w/ such experience who is still very up to date, reporting on one of the oldest (yet most active) fields in computing is of great value to all of us.

In conclusion:
Real smart people are reading these 3 blogs, learning what other real smart people are doing, and forming new even smarter ideas. For this to happen a medium of information exchange that does not waste smart peoples’ time, doesn’t insult smart peoples’ intelligence w/ obvious marketing ploys, and is written in a style that sparks their imagination, is required. So go read their blogs, you will learn stuff 🙂

I have always not done a blog on purpose.

I have always been of the opinion that while my opinions are indeed unique and interesting, I really only want to share the absolute best of them and blogs are notorious for people just going on about nothing. So I will do my best to keep the content intelligent and interesting, which will take time, which is the second reason I never did a blog, they take time.

And with that, its started.