Loading…
2021 UNC CAUSE has ended
Back To Schedule
Tuesday, October 5 • 1:00pm - 1:50pm
The Multi-Cloud Approach to Cost-Cutting/Effective IT Infrastructure Data Analytics

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
This talk will present UNCG ITS Infrastructure Analytics Team approach to "cloud-first" data pipelines, extraction of actionable insight from that data, and presentation of the results in dashboards to relevant stakeholders. Pipelines are established using an on-premises Apache Airflow instance, pushing data primarily into Google BigQuery, but also into Azure Blob Storage. From there, a variety of machine-learning classification algorithms have been written for anomaly detection/outlier elimination as well as metric/status forecasting for VMs, apps, and services. This is accomplished either by executing Python scripts using TensorFlow and SciKit Learn libraries with Google Cloud Functions or PySpark scripts in AzureML within Synapse. Finally, the results are put together in Power BI dashboards for stakeholders at the University. Each path we take is considered in terms of cost and speed, so along with the how, we’ll talk about why we do what we do in our multi-cloud approach.

Note: Log into Sched to see the "Open Zoom" button. Make sure to complete these 5 steps to get access.

Speakers
ND

Nathan Dzbenski

Other, UNC Greensboro
I'm the data scientist for the IT Services Infrastructure Analytics Team at the University of North Carolina at Greensboro.
TW

Tim Watts

Integration Specialist, UNC Greensboro
I wrangle data.


Tuesday October 5, 2021 1:00pm - 1:50pm EDT
Zoom Meeting