Abstract
Industry 4.0 proposes the integration of the new generation of ICT solutions for the monitoring, adaptation, simulation, and optimisation of factories. With the democratization of sensors and actuators, factories and machine tools can now be sensorized and the data generated by these devices can be exploited, for instance, to optimize the utilization of the machines as well as their operation and maintenance. However, analysing the vast amount of data generated is resource demanding both in term of computing power and network bandwidth, thus requiring highly scalable solutions. This paper presents a novel big data platform for the management of machine generated data in the cloud. It brings together standard open source technologies which can be adapted to and deployed on different cloud infrastructures, hence reducing costs, minimising deployment difficulty and providing on-demand access to a virtually infinite set of computing, storage and network resources.