Using MapReduce Technique to Process 500GB of Server Logs
(Originally posted on Stuart's MATLAB Videos blog)
Here I'm using the MapReduce functionality in Parallel Processing Toolbox™ to process several hundred GBs of server logs from our web site. I want to be able to visualize the counts per minute of certain quantities and also filter the data to look for certain special requests to our web site. I start small, getting my algorithm to work with one file first.
Recorded: 16 Mar 2018