DSAN 6000: Big Data and Cloud Computing
Fall 2025
Tuesday, September 2, 2025
Can be processed on single machine? | No | Medium (Parallel Processing) |
Big! Parallel + Distributed Processing |
Yes | Small (Your Laptop) |
Medium (Data Streaming) |
|
Yes | No | ||
Can be stored on single machine? |
Big data is when the size of the data itself becomes part of the problem (Loukides 2010)
(Some of yall would fold in this scenario)
Examples and Use Cases
Examples and Use Cases
Examples and Use Cases
The SaaS Revolution
From Gaming to AI
Cloud GPU Offerings
# AWS GPU instances for different AI workloads
p4d.24xlarge # 8x NVIDIA A100 (40GB) - $32/hour
p3.16xlarge # 8x NVIDIA V100 (16GB) - $24/hour
g4dn.xlarge # 1x NVIDIA T4 (16GB) - $0.50/hour
The Scale Challenge
Modern AI Requires Massive Clusters
Technologies Enabling Scale
The Data Tsunami
Storage Technologies
The Economics of Data
Yesterday | Today | Tomorrow |
---|---|---|
Limited number of tools and vendors | Many tools and vendors to work with | Integrated tools and vendors |
One platform - few devices | Multiple platforms - many devices | Connected platforms and devices |
Data is scarce but manageable | Overabundance of data | Data is used for important business decisions |
IT has major influence and control | IT has limited influence and control | IT is strategic to the business |
People only work when they are at work | People work wherever they want | People have access to what they need wherever they are |
SaaS and Cloud Computing Statistics:
DSAN 6000 Week 2: Cloud Computing