Taming the beast: Automated testing for complex data pipelines

When you are faced with a system that processes massive datasets with complex data pipelines involving machine learning, how do you test effectively? When your tests results are less “pass” and “fail”, and more “sort of” and “not really”, how do you automate testing?

Trish Khoo draws upon her experience in testing complex data systems to demonstrate proven strategies for testing in this field. Her experience working on ultra-large-scale systems at Google in Mountain View, California shaped her technical approach to testing which she applies in her work as a consultant today.

Trish Khoo

Trish Khoo is a software development consultant and international keynote speaker. She has over 20 years of experience in the software industry, specialising in software testing, infrastructure and automation. Her journey has taken her from Microsoft to Google, from London to San Francisco, and many places in between. Now she helps companies all over the world with their software needs from her home base of Brisbane, Australia. She also dedicates time towards fostering a strong local tech startup community and mentoring other technologists. When she’s not doing this, she’s working on her creative pursuits – artwork, singing and writing. Learn more about Trish at her website http://trishkhoo.com.