by Clemens Siebler | May 22, 2018 | Big Data
StorageGRID Webscale 11 now supports native integration into the AWS cloud. This includes cross-region replication to AWS S3 using CloudMirror, triggering of notifications to Simple Notification Service (via SNS), as well as metadata streaming into Elasticsearch for...
by Clemens Siebler | Nov 28, 2017 | Big Data
tl;dr Performing metadata search on billions of objects is now possible with StorageGRID Webscale by streaming object metadata into Elasticsearch. Introduction One advantage of using object over block and file storage is that data can be enriched with metadata and...
by Clemens Siebler | Apr 7, 2017 | Big Data
Introduction In this post we will show how to deploy a "stateless" Apache Spark cluster on Kubernetes. Spark is a fast analytics engine designed for large-scale data processing. Furthermore, we will then run analytics queries against data sitting in S3, in our case...