Loading…

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Tuesday, January 22
 

9:00am

Introduction
Speakers
avatar for Mercè Crosas

Mercè Crosas

Chief Data Science and Technology Officer, IQSS, Harvard University
Mercè Crosas is the Chief Data Science and Technology Officer at the Institute for Quantitative Social Science (IQSS) at Harvard University. She has more than 10 years of experience leading the Dataverse project and more than 15 years of experience building data management and analysis... Read More →


Tuesday January 22, 2019 9:00am - 9:15am
Room S010 Tsai Auditorium

9:15am

Big Data Use Case: Jeff Lichtman
Speakers
avatar for Jeff Lichtman

Jeff Lichtman

Jeremy R. Knowles Professor of Molecular and Cellular Biology, Harvard PhD Program in Neuroscience


Tuesday January 22, 2019 9:15am - 9:40am
Room S010 Tsai Auditorium

9:40am

Sensitive Quantitative Data Use Case: Paul Avillach
Speakers
avatar for Paul Avillach

Paul Avillach

Assistant Professor of Biomedical Informatics, Harvard Medical School


Tuesday January 22, 2019 9:40am - 10:05am
Room S010 Tsai Auditorium

10:05am

Qualitative Data Use Case (Education): Liz Dawes Duraisingh
Speakers
avatar for Liz Dawes Duraisingh

Liz Dawes Duraisingh

Principal Investigator, Project Zero, Harvard Graduate School of Education


Tuesday January 22, 2019 10:05am - 10:30am
Room S010 Tsai Auditorium

10:30am

Coffee Break
Tuesday January 22, 2019 10:30am - 10:45am
CGIS South Concourse Hallway

10:45am

Effective Data Visualization: Principles and Practices
Principles and best practises of data visualization.

Speakers
avatar for Hugh Truslow

Hugh Truslow

Harvard College Library, Head, Social Sciences and Visualization Maps, Media, Data and Government


Tuesday January 22, 2019 10:45am - 11:00am
Room S020 Belfer Case Study Room

10:45am

Data Analysis Keynote: Xiao-Li Meng
Speakers
avatar for Xiao-Li Meng

Xiao-Li Meng

Whipple V. N. Jones Professor of Statistics, Harvard University, Department of Statistics


Tuesday January 22, 2019 10:45am - 11:05am
Room S010 Tsai Auditorium

10:45am

Creating a Data Use Agreement
Introduction to creating a Data Use Agreement (DUA).

Speakers
avatar for Alisa Jahns

Alisa Jahns

Senior Business Anaylst, Harvard University, Research Administration Systems
avatar for Megan Moore

Megan Moore

Associate Director, Grants and Contracts, Harvard University, Office for Sponsored Programs


Tuesday January 22, 2019 10:45am - 11:15am
Room S050

11:00am

Data Visualization with Tableau
Learn the basics of importing data into Tableau, generating and iterating on a variety of data visualizations, and creating dashboards.

Speakers
avatar for Jess Cohen-Tanugi

Jess Cohen-Tanugi

Visualization Specialist, Harvard Library


Tuesday January 22, 2019 11:00am - 12:30pm
Room S020 Belfer Case Study Room

11:05am

Network Analysis with Gephi
Introduction to basic network analysis concepts with an existing dataset.

Speakers
avatar for Jeremy Guillette

Jeremy Guillette

Digital Scholarship Facilitator, Harvard University, Department of History


Tuesday January 22, 2019 11:05am - 12:30pm
Room S010 Tsai Auditorium

11:15am

Finding / Accessing Social Science Data
Major resources, strategies, and contacts for accessing social science data through library systems.

Speakers
avatar for Hugh Truslow

Hugh Truslow

Harvard College Library, Head, Social Sciences and Visualization Maps, Media, Data and Government


Tuesday January 22, 2019 11:15am - 11:30am
Room S050

11:30am

Geospatial Data Acquisition & Evaluation
Learn how both geospatial data and Big Data are captured and prepared for analysis. Data sources include GPS, remote sensing, census, OpenStreetMap, GeoNames and social media. I will demonstrate how QGIS can be used to geocode large data files.

Speakers
avatar for Nicole Alexander

Nicole Alexander

Visiting Researcher, Center for Geographic Analysis, IQSS


Tuesday January 22, 2019 11:30am - 12:00pm
Room S050

12:00pm

Data Management Keynote: Amy Nurnberger
Speakers
avatar for Amy Nurnberger

Amy Nurnberger

Program Head for Data Management Services, MIT


Tuesday January 22, 2019 12:00pm - 12:20pm
Room S050

12:30pm

1:30pm

Data Visualization Keynote: Alyssa Goodman
Speakers
avatar for Alyssa Goodman

Alyssa Goodman

Robert Wheeler Willson Professor of Applied Astronomy, Harvard University, Department of Astronomy
Alyssa Goodman is the  Robert Wheeler Willson Professor of Applied Astronomy at Harvard University, co-Director for Science, Radcliffe Institute for Advanced Study, and a Research Associate of the Smithsonian Institution.  Goodman's research and teaching interests span astronomy... Read More →


Tuesday January 22, 2019 1:30pm - 1:50pm
Room S020 Belfer Case Study Room

1:30pm

Science Gateways
Introduction to Science Gateways, which allow science & engineering communities to access shared data, software, computing services, instruments, educational materials, and other resources specific to their disciplines.

Speakers
avatar for Raminder Singh

Raminder Singh

Manager of Data Science & Research Facilitation, FAS Research Computing, Harvard University


Tuesday January 22, 2019 1:30pm - 2:00pm
Room S010 Tsai Auditorium

1:30pm

Reproducible Computation at Scale in R
The drake R package is a general-purpose workflow manager for data-driven tasks in R. drake rebuilds intermediate data objects when their dependencies change, and it skips work when the results are already up to date. Not every run through starts from scratch, there is native support for parallel and distributed computing, and completed workflows have tangible evidence of reproducibility. This presentation introduces drake using a simple practical example from the social sciences.

Speakers
avatar for Will Landau

Will Landau

Research Statistician, Eli Lilly and Company
Will Landau does statistics and writes software in the life sciences industry, and he is the creator and maintainer of the drake R package.


Tuesday January 22, 2019 1:30pm - 2:00pm
Room S050

1:50pm

Interactive Data Visualization with D3
Introduction to creating interactive web-based data visualizations using the D3 language.

Speakers
avatar for Jess Cohen-Tanugi

Jess Cohen-Tanugi

Visualization Specialist, Harvard Library
avatar for Cole Crawford

Cole Crawford

Humanities Research Computing Specialist, Harvard University Digital Scholarship Support Group


Tuesday January 22, 2019 1:50pm - 5:00pm
Room S020 Belfer Case Study Room

2:00pm

Web Scraping and Web APIs
Web scraping is a method of extracting and restructuring information from web pages. This workshop will introduce basic techniques for web scraping using popular Python libraries. Participants will practice accessing websites, parsing information, and storing data in a CSV file. This workshop is intended for those who are new to web scraping, but have some familiarity with Python.

Speakers
avatar for Ista Zahn

Ista Zahn

Data Science Specialist, Harvard University, IQSS


Tuesday January 22, 2019 2:00pm - 5:00pm
Room S010 Tsai Auditorium

2:15pm

Electronic Lab Notebooks Overview
Using Electronic Lab Notebooks to increase the reproducibility of analyses.

Speakers
avatar for Julie Goldman

Julie Goldman

Countway Research Data Services Librarian, Harvard Medical School


Tuesday January 22, 2019 2:15pm - 2:45pm
Room S030

2:45pm

RSpace: An ELN Example & Its Workflow
Introduction to the RSpace Electronic Lab Notebook environment.

Speakers
avatar for Rory Macneil

Rory Macneil

Founder/CEO, RSpace


Tuesday January 22, 2019 2:45pm - 3:15pm
Room S030

3:15pm

Coffee Break
Tuesday January 22, 2019 3:15pm - 3:30pm
CGIS South Concourse Hallway

3:30pm

Text Analysis with Voyant Tools
Introduction to some ideas in text analysis using Voyant Tools. Also including some info on where to go when you need more than Voyant (which you will.)

Recommended: Install Voyant Tools locally from https://github.com/sgsinclair/VoyantServer

Speakers
avatar for Jeremy Guillette

Jeremy Guillette

Digital Scholarship Facilitator, Harvard University, Department of History


Tuesday January 22, 2019 3:30pm - 5:00pm
Room S030
 
Wednesday, January 23
 

9:00am

Geospatial Data Use Case: Dan O'Brien
Speakers
avatar for Daniel T. O’Brien

Daniel T. O’Brien

Associate Professor of Public Policy and Urban Affairs and Criminology and Criminal Justice, Northeastern University


Wednesday January 23, 2019 9:00am - 9:25am
Room S010 Tsai Auditorium

9:25am

Qualitative Data Use Case (History): Jill Lepore
Speakers
avatar for Jill Lepore

Jill Lepore

David Woods Kemper '41 Professor of American History, Harvard University, History Department


Wednesday January 23, 2019 9:25am - 9:50am
Room S010 Tsai Auditorium

9:50am

Quantitative Data Use Case: Dustin Tingley
Speakers
avatar for Dustin Tingley

Dustin Tingley

Professor of Government, Harvard University, Government Department


Wednesday January 23, 2019 9:50am - 10:15am
Room S010 Tsai Auditorium

10:15am

Coffee Break
Wednesday January 23, 2019 10:15am - 10:30am
CGIS South Concourse Hallway

10:30am

Looking Towards the Computing Horizon: A Northeast Cyberinfrastructure Lab
When data takes a front seat to compute.

Speakers
avatar for Scott Yockel

Scott Yockel

Director of Research Computing, Harvard University, FAS Research Computing


Wednesday January 23, 2019 10:30am - 11:00am
Room S020 Belfer Case Study Room

10:30am

Bayesian Modeling with Stan
This workshop will briefly cover the intuition of Bayesian inference and then introduce the Stan computer language, which uses Bayesian inference to estimate the unknown parameters of a model. The Stan language is free and open-source and can be interfaced from other free and open-source languages for data analysis such as R, Python, and Julia. Workshop participants will build incrementally more realistic models for success at a task, specifically success at putting in golf, and learn how to evaluate competing models in a Bayesian fashion.

Speakers
avatar for Ben Goodrich

Ben Goodrich

Core Developer, The Stan Development Team


Wednesday January 23, 2019 10:30am - 12:30pm
Room S010 Tsai Auditorium

10:30am

Custom Data Bases for Data Management
Modern-day research requires merging data sets from multiple sources while maintaining consistency and integrity. This workshop will discuss the fundamentals of using a lightweight database like SQLite to achieve those goals, which opens up the powerful world of SQL. We will also explore examples in which various analysis tools like R, Stata, and Python, can access, interrogate, and combine this information with little effort.

Speakers
avatar for Bob Freeman

Bob Freeman

Director, Research Technology Operations, Research Computing Services, HBS
avatar for Radhika Khetani

Radhika Khetani

Research Scientist, Harvard T.H. Chan School of Public Health


Wednesday January 23, 2019 10:30am - 12:30pm
Room S050

11:00am

Geospatial Data Visualization and Sharing with WorldMap
Introduction to the WorldMap platform (http://worldmap.harvard.edu) a system for sharing and visualizing geospatial information on the web. WorldMap can be used on its own or from within Dataverse where it provides basic spatial visualization capabilities.

Speakers
avatar for Jeff Blossom

Jeff Blossom

GIS Service Manager, Center for Geographic Analysis, Harvard University
avatar for Ben Lewis

Ben Lewis

Geospatial Technology Manager, Harvard University, Center for Geographic Analysis


Wednesday January 23, 2019 11:00am - 12:30pm
Room S020 Belfer Case Study Room

12:30pm

1:30pm

Stochastic Gradient Boosting Machines
Introduction to how a set of weak learners can create a single strong learner.

Speakers
avatar for Daina Bouquin

Daina Bouquin

Head Librarian, Harvard-Smithsonian Center for Astrophysics, Harvard Faculty of Arts and Sciences


Wednesday January 23, 2019 1:30pm - 2:00pm
Room S010 Tsai Auditorium

1:30pm

De-identification of Sensitive Data
Introduction to strategies for dealing with sensitive data.

Speakers
JS

Ji Su Yoo

Research Analyst, Harvard University, IQSS


Wednesday January 23, 2019 1:30pm - 2:00pm
Room S050

1:30pm

Version Control for Data, Script, & Documents
Tired of wading through files titled paper_v1, paper_v2, paper_v3_beforeAdvisor... ? This hands-on workshop will introduce the concepts of using a version control system to manage changes and the history of your data files (text, web, scripts, etc.). Attendees will participate in multiple exercises in setting up repositories, where the changes and history are stored, working through the iterative process of altering files & recording changes, and rewinding the clock to retrieve a file at any point in the creative process.

Speakers
avatar for Bob Freeman

Bob Freeman

Director, Research Technology Operations, Research Computing Services, HBS
avatar for Radhika Khetani

Radhika Khetani

Research Scientist, Harvard T.H. Chan School of Public Health


Wednesday January 23, 2019 1:30pm - 3:15pm
Room S020 Belfer Case Study Room

2:00pm

New Data Regulations: GDPR
How the new General Data Protection Regulation (GDPR) will affect your research.

Speakers
avatar for Ara Tahmassian

Ara Tahmassian

Chief Research Compliance Officer, Harvard University, Office of the Vice Provost for Research


Wednesday January 23, 2019 2:00pm - 2:30pm
Room S050

2:00pm

Machine Learning Workflow in Python
This workshop compares the performance of multiple machine learning classifiers over a high-dimensional dataset. It explores the techniques that help to improve the performance of classifiers through comparison, including cross-validation, parameter turning, ensemble methods, and feature selection. We will use Python 3, but the conceptual workflow can be applied to any programming language (e.g., R, Julia).

Speakers

Wednesday January 23, 2019 2:00pm - 5:00pm
Room S010 Tsai Auditorium

2:30pm

Research Software Preservation & Attribution
Introduction to the preservation and attribution of research software.

Speakers
avatar for Daina Bouquin

Daina Bouquin

Head Librarian, Harvard-Smithsonian Center for Astrophysics, Harvard Faculty of Arts and Sciences


Wednesday January 23, 2019 2:30pm - 3:00pm
Room S050

3:15pm

Coffee Break
Wednesday January 23, 2019 3:15pm - 3:30pm
CGIS South Concourse Hallway

3:30pm

Big Geospatial Data Processing (GPU powered fast query)
Introduction to processing geospatial data at scale.

Speakers
avatar for Ben Lewis

Ben Lewis

Geospatial Technology Manager, Harvard University, Center for Geographic Analysis
avatar for Josh Lieberman

Josh Lieberman

Senior Research Scientist, Harvard University, Center for Geographic Analysis
avatar for Randy Zwitch

Randy Zwitch

Senior Developer Advocate, OmniSci, Inc.
Randy Zwitch is a Senior Developer Advocate at OmniSci, enabling customers and community users alike to utilize OmniSci to its fullest potential. With broad industry experience in Energy, Digital Analytics, Banking, Telecommunications and Media, Randy brings a wealth of knowledge... Read More →


Wednesday January 23, 2019 3:30pm - 5:00pm
Room S020 Belfer Case Study Room

3:30pm

Disseminating & Preserving Data (Using Dataverse)
Using Harvard Dataverse, Harvard's repository for disseminating, publishing and archiving research data at the end of the lifecycle: a demo on how to publish research data to Harvard Dataverse, which includes documenting the data, choosing the right formats, setting terms, restrictions and permissions, etc. Conecting data to apublication. How to use data citation.

Files and Setup Inforomation:
https://github.com/IQSS/datafest/tree/master/DataFest-2019/Dataverse%20Demo%20Files

Speakers
avatar for Sonia Barbosa

Sonia Barbosa

Manager of Data Curation and Acquisition for the Harvard Dataverse Project, Harvard University, IQSS


Wednesday January 23, 2019 3:30pm - 5:00pm
Room S030