Event processing, time series aggregation and analysis, and finally analysis of structural patterns between those data snippets can all be done on Hadoop clusters on huge data volumes.
In order to find hidden relations and invisible structures one has to combine three disciplines using a variety of tools. Luckily, the Hadoop ecosystem offers many of such tools. In this session you can see practical examples and a demonstration of the "Hadoop-Oscilloscope". Generic analysis patterns and recommendations towards selection of appropriate algorithms will also provide additional background.
All starts with a question / problem ?
How has …. Changed (descriptive) ?
What will happen if .... Changes ? (impact)
How will .... evolve? (forecast)
Domain Knowledge and ituition help us to get a starting point
TSA: offers multiple specialities, one has to select the right incredients
Source for info is:
Measured data from ….
http://images.google.de/imgres?imgurl=http%3A%2F%2F3.bp.blogspot.com%2F-tEkIR2kcyCY%2FVEcQJGrqb3I%2FAAAAAAAAABU%2F9Nj4hxeuqa0%2Fs1600%2FTHAI1.jpg&imgrefurl=http%3A%2F%2Fkonwersatorium1-ms-pjwstk.blogspot.com%2F2014%2F10%2Fthe-human-artificial-intelligence_22.html&h=958&w=965&tbnid=WscyQ01kH-s7CM%3A&docid=sGVehcJYs2-e1M&ei=gy6aV4zmJMX1UqSwsYAO&tbm=isch&iact=rc&uact=3&dur=774&page=1&start=0&ndsp=36&ved=0ahUKEwjMs_6BxpbOAhXFuhQKHSRYDOAQMwhEKAowCg&bih=1058&biw=1804
https://openclipart.org/download/242296/remix-fossasia-2016-contest4.svg
Results tell us about very specific properties of the system:
Lets look into a thermodynamics:
http://images.google.de/imgres?imgurl=http%3A%2F%2F3.bp.blogspot.com%2F-tEkIR2kcyCY%2FVEcQJGrqb3I%2FAAAAAAAAABU%2F9Nj4hxeuqa0%2Fs1600%2FTHAI1.jpg&imgrefurl=http%3A%2F%2Fkonwersatorium1-ms-pjwstk.blogspot.com%2F2014%2F10%2Fthe-human-artificial-intelligence_22.html&h=958&w=965&tbnid=WscyQ01kH-s7CM%3A&docid=sGVehcJYs2-e1M&ei=gy6aV4zmJMX1UqSwsYAO&tbm=isch&iact=rc&uact=3&dur=774&page=1&start=0&ndsp=36&ved=0ahUKEwjMs_6BxpbOAhXFuhQKHSRYDOAQMwhEKAowCg&bih=1058&biw=1804
https://openclipart.org/download/242296/remix-fossasia-2016-contest4.svg
Results tell us about very specific properties of the system:
Lets look into a thermodynamics:
http://images.google.de/imgres?imgurl=http%3A%2F%2F3.bp.blogspot.com%2F-tEkIR2kcyCY%2FVEcQJGrqb3I%2FAAAAAAAAABU%2F9Nj4hxeuqa0%2Fs1600%2FTHAI1.jpg&imgrefurl=http%3A%2F%2Fkonwersatorium1-ms-pjwstk.blogspot.com%2F2014%2F10%2Fthe-human-artificial-intelligence_22.html&h=958&w=965&tbnid=WscyQ01kH-s7CM%3A&docid=sGVehcJYs2-e1M&ei=gy6aV4zmJMX1UqSwsYAO&tbm=isch&iact=rc&uact=3&dur=774&page=1&start=0&ndsp=36&ved=0ahUKEwjMs_6BxpbOAhXFuhQKHSRYDOAQMwhEKAowCg&bih=1058&biw=1804
https://openclipart.org/download/242296/remix-fossasia-2016-contest4.svg
There are some open questions … (see yellow bubble)
The ARIMA model can be viewed as a "cascade" of two models. The first is non-stationary:
{\displaystyle Y_{t}=\left(1-L\right)^{d}X_{t}}while the second is wide-sense stationary:
{\displaystyle \left(1-\sum _{i=1}^{p}\phi _{i}L^{i}\right)Y_{t}=\left(1+\sum _{i=1}^{q}\theta _{i}L^{i}\right)\varepsilon _{t}\,.}Now forecasts can be made for the process {\displaystyle Y_{t}}, using a generalization of the method of autoregressive forecasting.
The ARIMA model can be viewed as a "cascade" of two models. The first is non-stationary:
{\displaystyle Y_{t}=\left(1-L\right)^{d}X_{t}}while the second is wide-sense stationary:
{\displaystyle \left(1-\sum _{i=1}^{p}\phi _{i}L^{i}\right)Y_{t}=\left(1+\sum _{i=1}^{q}\theta _{i}L^{i}\right)\varepsilon _{t}\,.}Now forecasts can be made for the process {\displaystyle Y_{t}}, using a generalization of the method of autoregressive forecasting.