Tag Archives

3 Articles

Posted by admin on

Data analytics and democracy

Today, privacy campaigners have been circulating tweets about Brittany Kaiser’s document dump proving that Cambridge Analytics and AIG were working as a single entity. This would seem somewhat contrary to what the police and the ICO have been able to find. There’s more to find here and when I find it, I’ll post the stories here.

Posted by admin on

Big Data

I am once again trying to write my blog on solutions architecture and the GDPR. I looked up “Data Lake” again and came across some very good resources in a You Tube channel from Intricity.

This summarises the design bifurcation between distributed data sources and unified query logic. It’s five years old. He or his teachers got there first i.e. before me.
I also had a quick look at “Born in the Cloud”, and “Why Hadoop is dying!”
All very insightful.

Posted by admin on

Apache Flume

Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. It uses a simple extensible data model that allows for online analytic application. I need to check this out.

By continuing to use the site, you agree to the use of cookies. more information

The cookie settings on this website are set to "allow cookies" to give you the best browsing experience possible. If you continue to use this website without changing your cookie settings or you click "Accept" below then you are consenting to this.

Close