SQL
Apr
2
6:30 PM18:30

SQL

Come out to DSI's SQL workshop! Learn about basic SQL querries and uses!

Download and install Anaconda Python distribution for PYTHON 2.
***WE ARE USING PYTHON 2, NOT PYTHON 3***
Download from this link:
https://www.continuum.io/downloads

Download the GitHub files:
https://github.com/dsiufl/SQL_Workshops


Download SQLite Browser: 
http://sqlitebrowser.org/

Download the Sakila Database https://www.dropbox.com/s/6pz7vkl5c32xujt/sakila.db?dl=0

View Event →
Python 2 w/ AIS
Mar
28
6:30 PM18:30

Python 2 w/ AIS

Come out to the Python 2 Workshop! This is the last workshop in the Python series! Python 2 will cover an introduction to Machine Learning concepts in Python. Machine learning is quickly becoming one of the most sought-after skill sets in the technology industry, as it allows computers to recognize patterns and make predictions without being explicitly programmed to do so.
----------------------------------------------------------------------------------------
To prepare for the upcoming workshop, please follow the steps below to download the programs and we’ll teach you the rest! (There's no need to do this if you have already attended a DSI workshop and have done it.)

Download and install Anaconda Python distribution for PYTHON 2. WE ARE USING PYTHON 2, NOT PYTHON 3. (this includes Jupyter, a Python interpreter that will allow you to run iPython notebooks)

Use this link:
https://www.continuum.io/downloads

Download the iPython notebooks and files from GitHub. Just click “clone or download" in the top right-hand corner, and select "download zip"

Use this link: 
https://github.com/dsiufl/Python-Workshops

Open up the Anaconda launcher, the Jupyter page will open in a web browser, and navigate to the location of the downloaded files. Through the Jupyter webpage, you will be able to run the iPython inotebook.

The workshop that we will be using is Python II Fall 2016 - Student.ipynb

View Event →
Python 1 w/ AIS
Mar
21
6:30 PM18:30

Python 1 w/ AIS

Python 1 is structured as an introduction to data anaylsis with Python. In this workshop, you will learn how to upload .csv files, clean your data, and perform some basic analytics and visualization. This workshop will introduce you to the numpy and pandas python libraries, which are used for scientific computing and databases, resepctively. 

PLEASE follow the instructions listed below PRIOR to coming to the workshop:

Download and install Anaconda Python distribution for PYTHON 2. WE ARE USING PYTHON 2, NOT PYTHON 3. (this includes Jupyter, a Python interpreter that will allow you to run iPython notebooks) NOTE: you can use Python 3 but there are some syntax differences, so be familiar with these if you plan on using Python 3.

Use this link:
https://www.continuum.io/downloads

Download the iPython notebooks and files from GitHub. Just click “clone or download" in the top right-hand corner, and select "download zip"

Use this link: 
https://github.com/dsiufl/Python-Workshops

Open up the Anaconda launcher, the Jupyter page will open in a web browser, and navigate to the location of the downloaded files. Through the Jupyter webpage, you will be able to run the iPython inotebook.

We will be using the UF DSI Python 1 student - update.ipynb notebook

View Event →
Data Visualization
Mar
19
6:30 PM18:30

Data Visualization

Deepen your Python skills by taking a tour of the visualization library Seaborn. Learn how to make stunning heatmaps, histograms and more.

To prepare for the upcoming workshop, please follow the steps below to download the programs and we’ll teach you the rest!

Download and install Anaconda Python distribution for PYTHON 2. WE ARE USING PYTHON 2, NOT PYTHON 3. (this includes Jupyter, a Python interpreter that will allow you to run iPython notebooks)

Use this link:
https://www.continuum.io/

Download the iPython notebooks and files from GitHub. Just click “clone or download" in the top right-hand corner, and select "download zip"

Use this link: 
https://github.com/dsiufl/DataViz

Open up the Anaconda launcher, the Jupyter page will open in a web browser, and navigate to the location of the downloaded files. Through the Jupyter webpage, you will be able to run the Jupyter inotebook.

We will be using "Data Visualization - Student.ipynb" for this workshop!

View Event →
Symposium
Mar
17
10:30 AM10:30

Symposium

Come spend your Saturday at the largest DSI event of the year - our annual Symposium. 

Begin the day with coffee, remarks from DSI leadership and the UFII Director, and a keynote.

The symposium continues with speakers from a wide range of research fields at UF in three breakout sessions of four speakers each. Learn about computer vision, bioinformatics, political forecasting, business analytics, and more.

Come spend your Saturday at the largest DSI event of the year - our annual Symposium. 

Begin the day with coffee, remarks from DSI leadership and the UFII Director, and a keynote.

The symposium continues with speakers from a wide range of research fields at UF in three breakout sessions of four speakers each. Learn about computer vision, bioinformatics, political forecasting, business analytics, and more.

Our symposium will also include two rounds of workshops with several choices in each round- so you can brush up on your Python, learn about data visualization, or deepen your knowledge of machine learning.

This is a fantastic opportunity to network with students and faculty who are passionate about the impact of data science and the tools they utilize to realize that impact. 

Coffee and Lunch will be served. 

If you plan to attend, please RSVP through this form: 

https://goo.gl/forms/5jL1Ur3PtGYO2PBc2

Sign up by February 24th and be in the first 100 people to sign up to get a free DSI shirt!

While we urge you to RSVP for food estimates, we will not turn anyone away, so feel free to bring a friend! 


The schedule is below: 

10:30 - 11:00  Registration & Coffee in Grand Ballroom
11:00 - 11:15  Bobbie Isaly - What is DSI?
11:15 - 11:30  Dr. George Michailidis - What is the UFII?
11:30 - 12:30  Keynote Speaker - Dr Manuel Bermúdez
12:30 - 1:20  Networking Lunch
1:30 - 2:00  Breakout Session 1 (20 minute presentations, 10 minute Q&A)
2:10 - 2:40  Breakout Session 2 (20 minute presentations, 10 minute Q&A)
2:50 - 3:20  Breakout Session 3 (20 minute presentations, 10 minute Q&A)
3:30 - 4:10  Workshop Session 1 (4 workshops)
4:20 - 5:00  Workshop Session 2 (4 workshops)
5:05 - 5:10  Closing Remarks and How to Get Involved

Breakout session 1:

Room 1

Smart /Green Manufacturing:  Data Enabled Decision Making and Optimization Applications

Panos M. Pardalos, ISE Department

Center for Applied Optimization, University of Florida

http://www.ise.ufl.edu/pardalos

 

Smart manufacturing (Industry 4.0) is the fourth industrial revolution. With advances in information and telecommunication technologies and data enabled decision making, smart manufacturing can be an essential component of sustainable development.

 

We are going to discuss some successes and focus on data enabled decision making and optimization applications. In addition, we will discuss future research directions and new challenges to society.

 

 

Room 2

Behavioral Finance, Data Science, and Sports: Umpires and MLB Totals Market Efficiency

Dr. Brian M Mills, Department of Tourism, Recreation and Sport Management

 

Sports betting markets have been used extensively in understanding market efficiency and behavioral biases, and have played a public role in generating interest in data science and analytics. We use this setting to test the propensity for the MLB totals market to integrate information about umpire home plate assignments, which are only known to the public for certain games. We first use generalized additive models to estimate the strike zone surface in MLB using data on individual pitch location for 2.5 million called pitches from 2008 through 2014. From these models, we aggregate error terms at the individual umpire level as a measure of favorability toward offense or defense, and insert this measure into least squares regressions to identify effects of umpire behavior on actual run totals. We then identify whether totals lines adjust upon release of information about umpire assignments to the public for certain games. Our regressions show that while the market adjusts slightly to umpire assignments, it does not adjust fully, and there are opportunities for sharp bettors to take advantage of this information. We exhibit a simple betting strategy using this granular umpire decision data that returns nearly 10% per bet.

 

Room 3
Brief Overview of Statistical Designs

Matthew Robinson, Department of Biostatistics


A brief overview of common statistical designs, sample size and power analysis, data formatting, and restricted randomization. We talk about statistical ways to compare groups and measure associations between variables while avoiding common pitfalls such as confounding. Additional related topics of sample size considerations, power analysis, data formatting, and methods of randomization will also be covered.

 

Room 4

Concept Drift Detection: the State-of-the-Art
Shujian Yu, Computational NeuroEngineering Laboratory


In a streaming environment, there is often a need for statistical prediction models to detect and adapt to concept drifts (i.e., changes in the joint distribution between predictor and response variables) so as to mitigate deteriorating predictive performance over time. Various concept drift detection approaches have been proposed in the past decades. However, they do not perform well across different data stream distributions and rely heavily on the availability of true labels. This talk presents a novel framework that can detect and also adapt to the various concept drift types, even in the scenario of expensive labels. The framework leverages a hierarchical set of hypothesis tests in an online fashion to detect concept drifts and employs an adaptive training strategy to significantly boost its adaptation capability. A Request-and-Reverify strategy is further incorporated to significantly reduce the requirement of true labels. The performance of the proposed framework is compared to benchmark approaches using both simulated and real-world datasets spanning the breadth of concept drift types. The proposed approach significantly outperforms benchmark solutions in terms of precision, delay of detection, the adaptability across different concepts as well as the number of required true labels.
 

 

Breakout session 2:

Room 1

Title: Visualizing Student Success Using a Sankey Diagram in Tableau
Tim Young, CLAS

Assistant Director for Data Management and Analysis


Student success is often only considered with a metric like retention or the four and six year graduation rates of a cohort.  Simple statistics like these often masks subtle changes that happen at different points along a student’s academic career. I will demonstrate the ability to explore cohorts with a Sankey diagram (a.k.a. ribbon diagram) using Tableau software.  I will also demonstrate how this visualization tool can be used for other purposes like exploring student success in course sequences.

View Event →
Python 0 w/ AIS
Mar
14
6:30 PM18:30

Python 0 w/ AIS

Welcome back to a reboot of our Python 0 workshop! If you couldn't make it last time, head over to the Informatics Institute (next to the french fries & Marston) and learn some Python! This workshop will walk you through the essentials of programming in the Python language, and will cover the basics of Python programming. This workshop is very beginner friendly, but all skill levels are encouraged to attend. Please bring a laptop. RSVP through our Facebook event.

PLEASE follow the instructions listed below PRIOR to coming to the workshop:

Download and install Anaconda Python distribution for PYTHON 2. WE ARE USING PYTHON 2, NOT PYTHON 3. (this includes Jupyter, a Python interpreter that will allow you to run iPython notebooks) NOTE: you can use Python 3 but there are some syntax differences, so be familiar with these. 

Use this link:
https://www.continuum.io/downloads

Download the iPython notebooks and files from GitHub. Just click “clone or download" in the top right-hand corner, and select "download zip"

Use this link: 
https://github.com/dsiufl/Python-Workshops

Open up the Anaconda launcher, the Jupyter page will open in a web browser, and navigate to the location of the downloaded files. Through the Jupyter webpage, you will be able to run the iPython inotebook.

The workshop that we will be using is UF DSI Python 0 - 10_26_16 - Student.ipynb

View Event →
R2: Sports Data Workshop
Feb
28
6:30 PM18:30

R2: Sports Data Workshop

Come out to DSI's last R Workshop in the series! Expand your knowledge of R and its applications. 

This workshop will focus heavily on bringing all the information from the first two workshops together so that users feel confident taking on independent projects and research challenges. Specifically, it will cover finding and downloading libraries, visualization, and approaches to statistical modeling and machine learning. 

It will be more advanced than the other two workshops, and if you want the lessons for the first two workshops contact Tyler Richards at tylerjrichards@gmail.com

View Event →
R1 Workshop
Feb
21
6:30 PM18:30

R1 Workshop

Welcome to the second part of our three-part R workshop series! In this R workshop, we will finish up some topics left over from R0 and take a dive into the tidyverse! The tidyverse is a coherent system of packages for data manipulation, exploration and visualization that share a common design philosophy, actively making working with data easier. 

Please make sure to have R installed and working prior to the meeting, and bring your laptop. 

To Download RSTUDIO:

First download R: https://cran.cnr.berkeley.edu/ 

For Windows: Open up the link for windows and select ‘install R for the first time’

For Mac: Open up the link for Mac OS X and select ‘R-3.2.4.pkg’

Once you have downloaded R we need to download R-studio
https://www.rstudio.com/products/rstudio/download/

Select your operating system and complete the download. 

Run Rstudio and make sure you can run something like ‘1+1’

View Event →
R0 Workshop
Feb
13
6:30 PM18:30

R0 Workshop

This introduction to R workshop is aimed at those who are interested in R and may not have any experience using the language. 

We will be covering basic functions related to data structures and data types. We will also work through importing a data set and completing some basic manipulations.

Before you arrive, please download R and RStudio!

To download RStudio:

First download R: https://cran.cnr.berkeley.edu/ 

For Windows: Open up the link for windows and select ‘install R for the first time’

For Mac: Open up the link for Mac OS X and select ‘R-3.4.1.pkg’

Once you have downloaded R we need to download R-studio
https://www.rstudio.com/products/rstudio/download/

Select your operating system and complete the download. 

Run Rstudio and make sure you can run something like ‘1+1’

If you run into trouble downloading Rstudio come to the meeting early and we can help you.

View Event →
Python 2
Feb
7
6:30 PM18:30

Python 2

Come out to the Python 2 Workshop! This is the last workshop in the Python series! Python 2 will cover an introduction to Machine Learning concepts in Python. Machine learning is quickly becoming one of the most sought-after skill sets in the technology industry, as it allows computers to recognize patterns and make predictions without being explicitly programmed to do so.
----------------------------------------------------------------------------------------
To prepare for the upcoming workshop, please follow the steps below to download the programs and we’ll teach you the rest! (There's no need to do this if you have already attended a DSI workshop and have done it.)

Download and install Anaconda Python distribution for PYTHON 2. WE ARE USING PYTHON 2, NOT PYTHON 3. (this includes Jupyter, a Python interpreter that will allow you to run iPython notebooks)

Use this link:
https://www.continuum.io/downloads

Download the iPython notebooks and files from GitHub. Just click “clone or download" in the top right-hand corner, and select "download zip"

Use this link: 
https://github.com/dsiufl/Python-Workshops

Open up the Anaconda launcher, the Jupyter page will open in a web browser, and navigate to the location of the downloaded files. Through the Jupyter webpage, you will be able to run the iPython inotebook.

The workshop that we will be using is Python II Fall 2016 - Student.ipynb.

View Event →
Python 1
Jan
31
6:30 PM18:30

Python 1

Python 1 is structured as an introduction to data anaylsis with Python. In this workshop, you will learn how to upload .csv files, clean your data, and perform some basic analytics and visualization. This workshop will introduce you to the numpy and pandas python libraries, which are used for scientific computing and databases, resepctively. 

PLEASE follow the instructions listed below PRIOR to coming to the workshop:

Download and install Anaconda Python distribution for PYTHON 2. WE ARE USING PYTHON 2, NOT PYTHON 3. (this includes Jupyter, a Python interpreter that will allow you to run iPython notebooks) NOTE: you can use Python 3 but there are some syntax differences, so be familiar with these if you plan on using Python 3.

Use this link:
https://www.continuum.io/downloads

Download the iPython notebooks and files from GitHub. Just click “clone or download" in the top right-hand corner, and select "download zip"

Use this link: 
https://github.com/dsiufl/Python-Workshops

Open up the Anaconda launcher, the Jupyter page will open in a web browser, and navigate to the location of the downloaded files. Through the Jupyter webpage, you will be able to run the iPython inotebook.

We will be using the DataSciUF Python I.ipynb notebook.

View Event →
Software Carpentry Workshop
Jan
23
9:00 AM09:00

Software Carpentry Workshop

The UF Carpentries Club, Informatics Institute, Biodiversity Institute, and Marston Science Library are combining to offer a two day Software Carpentry Workshop, Monday-Tuesday, January 22-23, 2018, to help the UF community get started learning R and common software development tools such as the Unix shell, SQL, and Git. Need to learn some programming for a class? Have data you’re scared to work with in Excel? Want to collaborate effectively with your peers? We were there once too. Come start learning with us.


There will be a two day workshop on Monday and Tuesday, January 22.23, 2018, hosted at the offices of Informatics Institute and Biodiversity Institute in the CSE Building.


For the schedule and more details about the workshop, visit https://uf-carpentry.github.io/2018-01-22-UFII/.


Click here for more details and to register. https://informatics.institute.ufl.edu/event/software-carpentry-spring-2018-workshop/


For further information, contact Matthew Collins, Technical Operations Manager, Advanced Computing and Information Systems Lab, ECE, at mcollins@acis.ufl.edu.


January 22-23, 2018


9:00 AM to 4:30 PM Each Day


UF Informatics Institute & Biodiversity Institute


Room E251 CSE Building


432 Newell Drive


Gainesville, FL 32611

Seating is limited. Register now!

View Event →
Software Carpentry Workshop
Jan
22
9:00 AM09:00

Software Carpentry Workshop

The UF Carpentries Club, Informatics Institute, Biodiversity Institute, and Marston Science Library are combining to offer a two day Software Carpentry Workshop, Monday-Tuesday, January 22-23, 2018, to help the UF community get started learning R and common software development tools such as the Unix shell, SQL, and Git. Need to learn some programming for a class? Have data you’re scared to work with in Excel? Want to collaborate effectively with your peers? We were there once too. Come start learning with us.


There will be a two day workshop on Monday and Tuesday, January 22.23, 2018, hosted at the offices of Informatics Institute and Biodiversity Institute in the CSE Building.


For the schedule and more details about the workshop, visit https://uf-carpentry.github.io/2018-01-22-UFII/.


Click here for more details and to register. https://informatics.institute.ufl.edu/event/software-carpentry-spring-2018-workshop/


For further information, contact Matthew Collins, Technical Operations Manager, Advanced Computing and Information Systems Lab, ECE, at mcollins@acis.ufl.edu.


January 22-23, 2018


9:00 AM to 4:30 PM Each Day


UF Informatics Institute & Biodiversity Institute


Room E251 CSE Building


432 Newell Drive


Gainesville, FL 32611

Seating is limited. Register now!

View Event →
The Spiritual Dimension to Computing: DSI General Body Meeting #1
Jan
18
6:30 PM18:30

The Spiritual Dimension to Computing: DSI General Body Meeting #1

Welcome back from break! Start your semester off right by coming to DSI's first general body meeting of the spring semester. Learn about our upcoming events and opportunities. We will have food and Mr. Tony Barr will be speaking about the spiritual dimension to computing. More information on the talk below.

The Spiritual Dimension to Computing: A perspective from SAS creator

There are dramatic changes in our world that the internet and software are driving. Through paradoxes presented in literature and personal encounters with people seeking a higher truth, patterns are emerging. My quest for understanding has compelled me to work on A Model Of Reality, a new paradigm of computing. 

Reference: 

Anthony James Barr - Wikipedia

View Event →