PLATFORM FOR BIG DATA, NOSQL
AND RELATIONAL DATA. WHAT
MAKES SENSE FOR ME?
(+AZURE)
michael.epprecht@microsoft.com
http://blogs.gartner.com/doug-
laney/files/2012/01/ad949-3D-Data-Management-Controlling-
Data-Volume-Velocity-and-Variety.pdf
RoadDesignator DrivingStatus
A1 Difficulties
Batch Processing Interactive Analysis Stream Processing
Query runtime Minutes to hours Milliseconds to minutes Never-ending
Data volume TBs to PBs GBs to PBs Continuous stream
Programming model MapReduce Queries DAG
Users Developers Analysts and developers Developers
Originating project Google MapReduce Google Dremel Twitter Storm
Open source project Hadoop / Spark Drill / Shark / Impala
Hbase
Storm / Apache S4 /Kafka
How do I optimize my
fleet based on weather
and traffic patterns?
What’s the social
sentiment for my
brand or products
How do I better
predict future
outcomes?
GAIN COMPETITIVE ADVANTAGE BY MOVING FIRST AND FAST IN YOUR INDUSTRY
Web app
optimization
Smart meter
monitoring
Equipment
monitoring
Advertising
analysis
Life sciences
research
Fraud
detection
Healthcare
outcomes
Weather
forecasting
Natural resource
exploration
Social network
analysis
Churn
analysis
Traffic flow
optimization
IT infrastructure
optimization
Legal
discovery
persistent | distributed
• In Memory
• Efficient at Random
Reads/Writes
• Distributed, large
scale data store
• Utilizes Hadoop for
persistence
• Both HBase and
Hadoop are
distributed
http://developer.yahoo.com/hadoop/tutorial/module4.html
MANAGE ANY DATA, ANY SIZE, ANYWHERE
010101010101010101
1010101010101010
01010101010101
101010101010
Distributed Storage
(HDFS)
Hadoop architecture.
Distributed Processing
(Map Reduce)
INSIGHTS FOR ALL USERS THROUGH FAMILIAR TOOLS
PB TB GB
An object contained within a user database
Defines the scheme for the federation
Represent the database being sharded
Database that houses the federation object
System managed SQL databases
Contain part, or “slices” of data
Orders_federation
Orders_federation
CREATE FEDERATION fed_name(fed_key_label fed_key_type distribution_type)
Orders_federation
Orders_federation
 The key used for data distribution
 int, bigint, guid, varbinary
 Represent a single instance of a federation key.
 All rows in all federated tables with the same federation key value.
PK=5 PK=25 PK=35
PK=5 PK=25 PK=35
Contains only atomic units for member’s key range
Non-Federated table
SalesDB
Orders_federation
Orders_federation
Orders_Fed
[5000, 10000)
ALTER FEDERATION Orders_Fed SPLIT AT (tenant_id=7500)
[5000, 7500) & [7500, 10000)
SPLIT members to spread workloads over to more nodes
DROP members to shrink back to fewer nodes
SalesDB
Orders_federation
Orders_federation
Orders_Fed
[5000, 7500) & [7500, 10000)
USE FEDERATION Orders_Fed (tenant_id=7509)
Ensure apps can discover where the data is just-in-time
No “Shard Map” caching
Guaranteed member routing
Table Details
MANAGE ANY DATA, ANY SIZE ANYWHERE
1010101010101010
01010101010101
101010101010
Hadoop Connectors & ETL
virtual machines web sites cloud services SQL database noSQL database blob storage connect virtual network traffic manager
.
.
.
Frameworks
Services
Fabric
Infrastructure
N Central US, S Central US, N Europe, W Europe, E Asia, SE
Asia + 24 Edge CDN Locations
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
Automated
Managed Resources
Elastic
Usage Based
Rest Fundamentals
Rest Fundamentals

Rest Fundamentals