Designing Data Visualizations
Zurich	Region
8	people:	formerly	
Catalyst	Lab,	data	
science,	Incubator
Boston
12	people:	data	
science,	data	viz,	
MIT
London	
7	people:	data	
science,	data	viz,	
startups,	Imperial	
College
Cape	Town	
4	people:	data	
science,	data	viz,	
startups
TR	Labs	Formula
Startup	
Ecosystem
University	
Partner
Customer	Access
+ +
Waterloo
7	people:	data	
science,	data	viz,	
startups,	University	
of	Waterloo
Singapore
4	people:	
Launched	Feb	
2017,	focus	on	
FinTech
San	Francisco
12	people:	
Formerly	
StarMine,	data	
science,	quants
THOMSON REUTERS LABS
THREE HORIZONS
• Data Experience Developer at Thomson Reuters Labs
• Avid kayaker/fisher
• Can speak mandarin
• Exploring your data
• Understanding data types
• Visualization choices and considerations
• How data visualization can lead to data exploration
• Creating an experience with information
• New models of data visualization
• Telling a story through data
TALK OVERVIEW
PROCESS
EXECUTION
How it Works
It all starts with data
HOW IT WORKS – IT ALL STARTS WITH DATA
• Is it statistically accurate?
• What is your sample size?
• What is the margin of error?
• Are you looking at a long enough timeframe?
• When you use a sample to represent an audience, you must make
sure that the people in your sample are representative of the
audience.
• Always spot check your data when combining multiple datasets for
errors
• Examine outliers – nuggets or inaccuracies
HOW IT WORKS – IS IT ACCURATE?
•Unstructured text vs. structured text
•File format (csv, tsv, txt, json…)
•Encoding( vs. )
HOW IT WORKS – IS YOUR DATA CLEAN?
•Gaps in time
•Are you exploring all areas of where you can get data?
•Can you aggregate other data sources
• This can evolve over time
HOW IT WORKS – ARE THERE MISSING PARTS?
EXPLORING YOUR DATA
Unearthing the narrative of your visualization
• What's your hypothesis?
• Are there trends in the data?
• Start initial visualizations in excel/simple graph tools
• Are there outliers?
• Combine datasets
• Corresponding data that would be of value for comparison
• Deeper analysis
• Sentiment analysis
• NLP
• Consider viewing data with different diagrams/visualization
DATA EXPLORATION
•Not all datasets are complete
• Data cleaning will never get you to 100%
•Some information is confidential
•Consider ways to show missing data
EXPLORING YOUR DATA – WHY IS THERE
MISSING DATA?
Exploring	your	Data	– Are	you	twisting	the	
narrative?
• Hiding the greater picture
• Only focusing on a specific time can leave out
important comparison information
• Excluding data points to give greater merit to a topic
• Skewing Visualization in favour of your narrative
• Visualizing data can tell false truths when information
is not being accurately displayed
EXPLORING YOUR DATA – ARE YOU TWISTING
THE NARRATIVE?
Exploring	your	Data	– Are	you	twisting	the	
narrative?
EXPLORING YOUR DATA – ARE YOU TWISTING
THE NARRATIVE?
UNDERSTANDING DATA TYPES
• Financial
• Census/population
• Aggregated non-numeric
values
• Dates/time
• Percentage
NUMERIC VALUES
When to use:
• Bar graphs
• Scatterplots
• Line graphs
• Tables/data points
• *Pie/Donut
• BAD PIE
Not all pies are created equal
NUMERIC VALUES
• Open-ended response
• Tweets
• Address information
• Form data
• Text patterns
• *Sentiment analysis
• *Text extraction
STRING /RESPONSE TEXT
When to use:
• Word cloud
• Text snippet
• Tooltip
• Results output
NUMERIC VALUES
• Longitude and Latitude
• IP address
• Line interpolation
• Directions
• Traffic data
SPATIAL / GEOGRAPHIC
When to use:
• Choropleth maps
• Time lapse points
• Map interfaces
• Event detection
• Point to point travel
• API access to consolidated data
• Twitter
• RSS
• News feeds
• Analytics data
• Survey data
• Databases
• Customer data
GROUPED DATA SETS
When to use:
• Dashboards
• Reports
• Customer analytics
NUMERIC VALUES
Data Visualization Leads to Exploration
Iterative Development
Index all geo-located tweets referencing #UX in North America over a
few months and explore the information for insights with the UX space
Scenario
VISUALIZING DATA - CONSIDERATIONSPlot the lng/lat of the tweets
VISUALIZING DATA - CONSIDERATIONS
• Add radius weighting to follower count
• Sentiment analysis of feeling associated with tweet
• Add tooltip to show content of tweets
• Group sentiment to show distribution
Index all geo-located tweets referencing #UX in North America over a
few months and explore the information for insights with the UX space
#UXploration of Data
Recap Scenario
CREATING AN EXPERIENCE
Illustrating Insight
• Who are your audience?
• How much detail do they need?
• What is the margin of error in your data?
• Is the insight actionable or just informative?
• What story does the data tell?
• Why do you need a visualisation?
VISUALIZING DATA - CONSIDERATIONS
• Test different ways of visualizing your data
• Is this for analysisor for story telling?
• Multiple ways of seeing the same information can help reinforce
• Consider the scales and dimensions on what your visualizing
• Leverage the use of *colours
VISUALIZING DATA – HOW TO?
• Clear
• Specific
• Keep it simple
• To the point
• Inline with audience
VISUALIZING DATA – CONSIDERATIONS
• 3D – novelty vs. insight
• Visualizing data for the sake of visualizing
(number dressing)
• Over complicating the information
• Avoid graphical distortion – pick the right
scale
• Too many colours (no more than 6)
• Reduce the need for math
VISUALIZING DATA – THINGS TO AVOID
CASE STUDY
Electric Vehicles – are we making a cleaner
planet?
• Who are your audience?
• How much detail do they need?
• What is the margin of error in your data?
• Is the insight actionable or just informative?
• What story does the data tell?
• Why do you need a visualisation?
VISUALIZING DATA - CONSIDERATIONS
VISUALIZING DATA – Iterations .01
VISUALIZING DATA – Iterations .01
VISUALIZING DATA – Iterations .01
VISUALIZING DATA – Iterations .2
VISUALIZING DATA – Iterations .3
VISUALIZING DATA – Iterations .4
VISUALIZING DATA – Iterations .5
VISUALIZING DATA – Iterations .9
VISUALIZING DATA – outcome
VISUALIZING DATA – outcome
• Who are your audience?
• How much detail do they need?
• What is the margin of error in your data?
• Is the insight actionable or just informative?
• What story does the data tell?
• Why do you need a visualisation?
VISUALIZING DATA - CONSIDERATIONS
NEW MODEL OF DATA VIZ
Static to Interactive to static and so on.
VISUALIZING DATA – Static
VISUALIZING DATA – Static
VISUALIZING DATA – Sliced Static
VISUALIZING DATA – Article Integration
TELLING A STORY THROUGH DATA
VISUALIZING DATA – Static
VISUALIZING DATA – Static
VISUALIZING DATA – Static
VISUALIZING DATA – Static
DATA DESIGN DEVELOPMENT INTERSECTION
DATA
DEVELOPMENT
DESIGN
Explore Define
• Exploring your data
• Understanding data types
• Visualization choices and considerations
• How data visualization can lead to data exploration
• Creating an experience with information
• New models of data visualization
• Telling a story through data
IN CLOSING…
PROCESS
EXECUTION
THANKS

Designing Data Visualization