DAL
AUG 9, 2017
#datapopup
@datapopup
A day of fascinating talks and workshops
on data science applications in finance,
retail and technology
Breakfast + Registration
Opening Remarks
Nick Elprin
Co-Founder & CEO, Domino Data Lab
Talk by
Robert Welborn
Robert Welborn
AVP, Data Science, USAA
@robert_welborn
Talk by Eduardo
Ariño De la Rubia
Eduardo Ariño De la Rubia
Chief Data Scientist, Domino
@earino
Talk by
Danny Monistere
Danny Monistere
SVP - Client Services, Nielsen
@DannyMonistere
Lunch and Networking
The State of
Data Science in
Financial Services
Fireside Chat
Meltem Ballan
Senior Consultant-Advanced Analytics, Clarity Insights
Dave LaGassa
Financial Services Divisional Data Officer, Capital One
Fatih Akici
Sr. Risk Analyst, ACE Cash Express
Abhishek Goel
@meltball
@fatih_akici
SVP Strategic Data Solutions, Citi Bank
@IamAbhishekGoel
@CapitalOne
Solving Data Science’s First Mile Problem:
Or How I Learned to Stop Worrying and Love Linked Data
Jon Loyens
Chief Product Officer, Data World
@jonloyens
About	me	and	data.world
Jon Loyens
• Co-Founder and Chief Product Officer of data.world
• VP of Engineering for Traveler Products at HomeAway
• VP of Engineering and Director of the Labs group at Bazaarvoice.
• A/B testing and data-driven product management is my bag
• Openness, transparency and OSS is also my bag (and inspiration for data.world)
• Kubrick fan
• @jonloyens (pretty much everywhere)
for so many people, the first mile of data science is the last mile,
because people just quit when they see the data.
Prof. Eric Schwartz, University of Michigan
“
DATA	SCIENTISTS,	SMES,	ANALYSTS,	RESEARCHERS	
The first mile problem
Finding, cleaning, transforming, merging
data to prepare it for analysis
80%
1st Mile
2nd Mile 3rd Mile
Analysis App building
said building training datasets
said cleaning/organizing data
said Collecting datasets
Data scientists’ least favorite aspect of their job:
10%
57%
21%
All are first mile problems
18,000,000
open datasets
Slide title at the top
Knowledge?
How	we	think	about	data	today…
How	we	communicate	about	data	today…
Great	OSS	is…
• Well	documented
• Welcoming
• Transparent
• Has	community
• Standards
Great	DS	has…
• Context
• Documentation
• Provenance
• Community
• Transparency
• REPRODUCIBILITY
Need	a	way	of		capturing	continuous context
Integrated Social Linked
The data analysis process -- deriving insights from the
data -- is non-linear, investigative, and involves the
data scientist manipulating the data in an analysis
environment such as R or Python.
With data.world's APIs and integrations, data
scientists can seamlessly connect with data.world in
order to directly load and publish data to the platform.
INTEGRATED
SOCIAL	– BRING	PEOPLE	&	
DATA	TOGETHER
Linked data realizes the vision of evolving the
Web into a global data commons, allowing
applications to operate on top of an unbounded
set of data sources, via standardized access
mechanisms.
I expect that Linked Data will enable a
significant evolutionary step in leading the web
to it’s full potential.
“
Tim Berners-Lee
TED Talk 2009
Few formats convey MEANING
about the contents in a way that can be
SHARED and EXTENDED.
Data
Information
Knowledge
Wisdom
SAME WWW	It’s applying the architecture as the
of linked documents to… DATA
DATAFirst, break into ATOMIC	FACTS
SUBJECT,	PREDICATE,	OBJECT
( )
UNIVERSAL	IDENTIFIERS	
is	about	publishing	data	as
LINKED	DATA
ATOMIC	FACTS
and	using	
to	refer	to	concepts	and	relationships,	so	we	
can	agree	upon	the SEMANTIC	MEANING
of	data.
Data	can	enjoy	a
“NETWORK	EFFECT”
Each dataset that is added to the network
INCREASES the incrementalVALUE
of every data set in the network
So the PEOPLE and MACHINES who are using that
data to solve HUMANITY'S BIGGEST PROBLEMS
can leverage the sum of accumulated knowledge as
effectively as possible.
Slide title at the top
Leveraged Analytics at Scale
Paul Speaker
Senior Data Scientist,
The Dow Chemical Company
@DowChemical
Building Serverless Data
Pipelines in the Cloud
Manisha Sule
Director of Big Data Analytics, Linux Acedemy
@tweetDataS
Coffee Break
Talk by
Sanjeev Kumar
Sanjeev Kumar
Head of Analytics & Data Science,
Honeywell Oil & Gas
@SanjeevK_
Talk by
David Whiting
David Whiting
Head of Data Science Analysis,
Capital One
@CapitalOne
Coffee Break
Bias, Ethics, and
Social Justice in
the age of AI
Fireside Chat
Eduardo Ariño De la Rubia
Chief Data Scientist, Domino
Sudha Subramanian
Sr. Data Analyst, Sparkfish
Anna Popova
Data Scientist Senior Adviser, Dell
@sravanankarajuSravan Ankaraju
Founder, Data Scientist, Divergence Academy
@gosparkfish
@dell
@earino
@amdocsViswanath Puttagunta
Sr Data Scientist, Amdocs
Closing Remarks
Anna Anisin
Creator, Data Pop-up
@AnnaOnTheWeb
Thank you to our Sponsors:

How I Learned to Stop Worrying and Love Linked Data