Data Science Melbourne presents
Datathon 2016
21st April - 6th May 2016
Sign Up
Get Involved
Come and Participate
Novice to Experts
$5,000 Cash Prizes

And the winners are...

Thanks to everyone who took part - the standard was amazing.
Don’t forget you can still apply for the internship prizes here.

See you next year!

Pitching Comp…

1st - Dirty Dataing
Nick Barrington, Jasper Little, Kirill Kokorin

2nd - Tour No.1
Daniel Ma, Jon Blank, Jeff Lai, Yun Zhou

3rd - 4 Quarters
Sylvia Rodriguez, Daniel de Sousa, John Makau, Duong Nhu

Kaggle Comp…

1st - Smileyface
Noah Xiao, Ivan Liu

2nd - Simply Analytics
Dylan Matthews, Grant McKinnon

3rd - Tour No.1
Daniel Ma, Jon Blank, Jeff Lai, Yun Zhou

Latest News

The run in

7 hours to go

Please email your submissions to joost@datasciencemelbourne.com

Internships

The process for applying for the Internship positions is now set up. We have at least 6 places up for grabs, but you must be part of a team that submits a solution in order to apply. The application process will be open for a week after the pitch night. To view the jobs and apply click here.

Important notice: New Data

You should have received an email today (Wed 27th) with a download link to an updated data set. If you did not receive the email or cannot find it, let us know here.

Monday 25th April (ANZAC Day)

The top 5 presentations from the 2015 datathon have now been uploaded

Grant McKinnon is setting the pace in the Kaggle comp

Only 15 tickets left for the conference

23rd April - Hackday

Ready to Rumble?

Slides and code snippets are on the meetup site under the menu more >> files.

See here

Friday 22nd

Registered, signed the NDA and not got the sneak peak file yet? Please tell us here.

The final schedule for the hackday is just below this section, here.

Software that might be useful to install before tomorrow:

  1. R and R studio (free)
  2. Python (free)
  3. SQL Server Express (free)
  4. Tableau Desktop (free evaluation period will last 14 days)

The number of internships on offer has increased to 6. Zendesk, SAS, KPMG, iSelect (2) and Immersive. If you want to offer one, then let us know.

Thursday 21st April

The Datathon is now go!

It was standing room only at the kick-off, we had to have 2 sessions. Apologies to those who missed out on the presentation.

Joost will endevour to process all the NDAs tonight and get the sneak peak file emailed out ASAP.

Kaggle comp is now live. Sign up to Kaggle to enter. You will have to wait until Saturday when you get the full data to make a meaningful submission, but you could see if you can beat Sali Mali with a guess.

Any questions can be asked in the forum

Hack Day Schedule

Saturday 23rd April - Telstra, 242 Exhibition Street
Morning
09:00Arrival
Welcome to the Melbourne Datathon hackday! If you have not yet signed an NDA, please do so upon signing in. If you are looking for a team, grab a name sticker and follow the instructions. After signing in, make your way to the data station to load up the full dataset.
10:00-10:30Forming Teams
Attend this event if you are looking for a team. We will have muffins and instructions waiting for you.
11:00-11:30Data walkthrough, Q&A
In this presentation, our data sponsor will give a quick overview of the data and answer any questions you may have.
Presentation area full? Don’t worry, we’ll repeat this presentation at 12:00.
12:00-12:30Data walkthrough, Q&A
This presentation is a repeat of the 11:00 walkthrough and Q&A, for those of you that missed out on a seat.
12:30Lunch
A pizza lunch will be served in the kitchen area.
Afternoon
Optional tutorials in the presentation area for those who want to join us.
1:15-1:45In this tutorial we will show how to load the data into an SQL Server database, connect R to the database and do some initial exploration with Tableau.
Phil Brierley
1:45-2:00Internship Prize Details
Yuval Marom
2:00-2:30This tutorial is for anyone looking to learn how to get started with their analysis in Python. In particular, our mentor will walk you through some text mining and point you to the right resources to learn more.
Alistair Walsh
2:30-3:00This tutorial is for anyone looking to learn how to get started with their analysis in R. In particular, our mentor will walk you through some text mining and point you to the right resources to learn more.
Yuval Marom
3:00-3:30First predicitve model. Here we will build our first simple model in R and submit it to Kaggle.
Phil Brierley
3:30-4:00Would you like to learn how to effectively pitch your data science results to management? Attend this tutorial to learn all about communicating your data insights.
Mark Alexander
6:00pmEnd of our time at Gurrowa. I'm sure we can find a local pub to continue.

The DSM 2015 Datathon was a huge success. Thank you to all the 150 participants and 21 teams that submitted entries.
Please join us for this year’s event in April 2016, which is part of MeDaScIn 2016

Why

  • To learn from each other and cross pollinate skill sets
  • To provide a stage for potential employers and employees to meet
  • To create a buzz in Melbourne around Data Science and reverse the brain drain
  • To have fun!

How it works

  • Read this web page and sign up at the bottom.
  • Attend the launch event on April 21st to sign the Non-Disclosure Agreement (NDA) and get a “sneak peek” file to start your analysis. Attending the launch event is optional, but will give you an early start.
  • Attend the hack day on April 23rd to get the full data-set, to form teams, to work on your analysis and to ask questions. Attending the hack day is mandatory to participate, as this will be the only day that we will be handing out the full dataset.
  • Continue working with your team and submit a slide deck before 23:59, Wednesday, May 4th.
  • Five top teams will be pre-selected to pitch their findings on the night of May 6th.
  • Prizes will be awarded in the subsequent award ceremony
  • Presentations:
    • 1st. $1,500
    • 2nd. $1,000
    • 3rd. $500
  • Predictions: (Kaggle Competition)
    • 1st. $1,000
    • 2nd. $500
    • 3rd. $250
    • 4th. $150
    • 5th. $100
  • Internships will also be up for grabs to entrants of the Datathon (iSelect, KPMG, Immersive)

Pictures from last year:

presentations

FAQ

  • Do I need to be to be a data science rock star to enter?
    No, this is all about learning and knowledge transfer. Even if you’ve never done anything like this before, please come along. We offer tutorials and mentors on hack day to get you started.
  • What do I need to bring?
    You will need your laptop with your favourite tools installed. Bring lots of curiosity and energy. Don’t forget your power cord!
  • Do I need to already have a team?
    No, we are expecting most people will form teams on the hack day. The organisers will be around to facilitate this with a special event. Don’t worry if you don’t know anyone; lots of people won’t.
  • Can I enter as an individual?
    Yes, but the judging panel will favour teamwork. Each participant can only be part of one submission; you cannot be both on a team and an individual.
  • Why do I need to sign a Non-Disclosure Agreement?
    This is real data from a real ‘client.’ It is a condition of them releasing it to you that an NDA is first signed. It basically means you will not use the data for any other purpose, and that you will delete it at the end of the contest.
  • What if I need help?
    There will be a handful of very experienced ‘mentors’ floating around the room on hack day. The purpose of them being there is to give ‘training’ on tools and techniques to munge the data - please use them! We will also host a selection of tutorials.
  • What will be revealed about the data?
    Not much - it is your job to figure things out. On hack day, the data owners will be there to give a short presentation and answer any questions you have.
  • How ‘big’ is the data?
    We don’t know yet, but we will ensure it is manageable for a laptop. It is split into several text files of bite-sized chunks.
  • Are there set tasks?
    No, we provide very little initial guidance. As a true “data explorer,” you will have to come up with your own questions for the data. We want the datathon to be just like a real data science consulting task. Ask yourself what the data provider might want to learn, and how you might go about presenting that.
  • How will it be judged?
    The main focus of our panel will be on the team’s ability to translate their findings into meaningful, easily understandable, actionable and valuable insights. They have a hypothetical budget to allocate and you need to convince them it’s worth spending it on your analytics.
  • Is this like a Kaggle competition?
    There is a predictive component with separate prizes that will be run on Kaggle. This will run over the same time period with the winning team being revealed live at the presentation evening on the 6th May. You can enter as an individual or in a team and one member of the team must be at the presentation evening to be eligible for the prize.
  • How do we communicate and stay up to date?
    To ask questions, use the forum here. We are also on social media:

Facebook Twitter

WHEN

Day 1
21 Apr 2016

Evening Launch

Come along after work to sign the non-disclosure agreement and get your hands on the “sneak peek” dataset. Attending the launch event is not mandatory, but will give you an early start.
Day 2
23 Apr 2016

Saturday - Hack Day

On Saturday, we will provide everything you need to work on your data investigation: food, drinks, a co-working space, wifi - and, of course, the full dataset. If you are looking to join a team, this is a great opportunity to ask around and/or attend our special team formation event. During the day, the data provider will be there for a data walkthrough and Q&A. We also host a series of (optional) ‘master classes’ to demonstrate tools, techniques and skills. Attending the hack day is your only chance to get your hands on the full dataset, so don’t miss it!
Day 3
06 May 2016

Learn from the experts

During the day on Friday, we are hosting a separate data science conference. Leading industry experts will present their experiences managing and running analytics. You can register separately for this event on our conference website: http://www.datasciencemelbourne.com/medascin/.

Pitch Time

On the final night, we will decide which team takes home the honour of Melbourne Datathon champions! Five pre-selected teams will give their pitches before our professional panel.

Award Ceremony

Once the teams have presented to our panel, we’ll retire to the bar for a drink and then award the prizes. Please collect your wristband for free drinks prior to leaving the Arena.

WHERE

Click on the links below to see the venue locations

Collective Campus - level 1, 20 Queens St

Gurrowa, Telstra, Level 2, 242 Exhibition Street, Melbourne

nab Arena - 700 Bourke St

Platform28 - 82 Village St. Docklands

platform28

The Panel

This is our board of directors who you need to sell your story to!

Anthony Alessi

Team Leader, Analytics Operations at SEEK

Ben Pattison

Head of Predictive Analytics, Marketing at NAB

Kendra Vant

GM Big Data Analytics at Telstra

Ross Farrelly

Chief Data Scientist at Teradata ANZ

Mentors

There will be a few experienced people floating around and available to help you out with technical things. Please use them, it’s a good opportunity to get a one on one tutorial.

If anyone else wants to help, just turn up on the hack day.

Sign Up!

Registrations are now closed.

Thanks

All this would not be possible without the supporters and friends of Data Science Melbourne

dsmlogo1smaller

Collective Campus, KPMG, La Trobe University, Melbourne Business School, Monash University, NAB, Northraine, Pulse Data Science, RMIT, Rubix Consulting, SAS, Servian, Sportsbet, Telstra, Teradata , Zendesk, Zuse Digital

Special thanks to:



The datathon is part of the Melbourne Data Science Initiative, MeDaScIn 2016

The datathon is also part of the Melbourne Knowledge Week, 2nd – 8th May 2016, proudly presented by the City of Melbourne