Google: Helping Data Scientists Organize their Files

Role: Lead Designer & Researcher

Team: Product Manager, 2 Engineers

Launched February 2024


Final Designs

The new page I designed provides a file management system to help data scientists organize, search, and declutter their workspace through three main tools:

What I did

I designed a centralized file management system (like a Google Drive) to help data scientists organize their work. Along the way, I:

Owned the end-to-end design process • Partnered w/ Product, Research, Engineering, & DevRel • Tested designs w/ users • Iterated off feedback • Advocated for user needs • Adapted to new technical constraints • Supported engineers in implementation

1) A centralized search & filter system for locating files

“Bulk delete will save me so much time.”

“Less work for me.”

“Finally! I really needed folders.”

Discovery


Using insights from past user research, I created 3 personas. I worked with the PM to prioritize user needs.

Early Collaboration

To understand the problem space, I partnered with the following roles:

  • Senior Designer to review the history of this project

  • Research Lead to review existing research + plan additional research

  • Product Manager to discuss business goals and business constraints 

  • Developer Relations Engineer to understand the technical context


Defining User Needs

The Product

Kaggle is a Google-owned machine learning community with over 10 million users. Data scientists create and publish their work to Kaggle’s vast library of user-generated content.

Solution & User Feedback

Design Phase 1:

Early Exploration


Initial Ideation

User Flow Mapping

Explored multiple ways to execute each CUJ

We wanted user feedback to inform my design direction going forward. I prepared four paper prototypes with varying features and information architecture to test on users.

  • Option A introduced folders

  • Option B organized content by type

  • Option C incorporated analytics

  • Option D auto-organized content by project

Applying feedback & solving design challenges

Feedback from concept testing provided me clear direction on which features users needed the most. Now, it was time to share my insights with the project team, align on new design priorities, and dive into solving complex UI and interaction challenges.

The Problem

Creators on Kaggle have so many loose files that they can’t locate or manage their work efficiently. This slows down their workflow and makes creating content on Kaggle harder.  

“Can’t manage my files effectively” has been a top user issue for the past 3 years.

Folders Enabled them to organize & quickly find their work

Bulk Delete Facilitated file management for multiple CUJs

Auto-populated Projects Facilitated organization & simplified project navigation

Users didn’t need:

Analytics Weren’t useful to most

  • Bulk Delete was a high user-value feature at risk of getting cut due to engineering complexity.  

  • I met with the back-end engineer to pinpoint the issue.

  • I learned that the complexity arose from one step of the flow that was not crucial to the experience. To save the feature, I redesigned a simpler experience that was easier to implement.

Result: Users can now efficiently declutter their workspace with new multi-select tool + bulk delete.

My centralized search design required all of a user’s content (datasets, models, notebooks, and discussion posts) to be listed in one place. The tricky part: each content type had distinct formatting, metadata & interactions. I needed to standardize their formatting so that they looked uniform when listed together.

Users were excited at the prospect of selecting multiple files to delete at once. Our current design system, however, had no multi-select capabilities, and there were many deletion complexities to account for in my design.

Task: Enable users to quickly and safely delete items in bulk. The solution should:

  • Enable users to select multiple items at once

  • Warn of consequences of deleting items; give specialized warnings for each content type

  • Prompt users to review their selection before deleting

Task: Create one standard way to display 4 different types of content: datasets, notebooks, models, and discussion posts.

  • I started by auditing the metadata & format of each content type

  • I explored condensing content into 1, 2, or 3 lines

  • Ultimately, I chose the 3-line version in order to maintain consistency and include all necessary metadata

Result: With a standardized 3-line format, all content types can now be listed together, allowing users to efficiently view and search all of their work from one place.

Main page

Creating prototypes for testing

Concept Testing

The Goal

Enable content creators to efficiently organize, manage, and find their work from one centralized hub.

3) A Bulk Delete tool for efficient file management

User Feedback

2) Auto-generated project folders for assisted organization

Paper Wireframes

Explored structure & feature approaches

User Impact

With this new system, users can now:

  • Jump back into ongoing projects

  • Save time finding the right file

  • Organize their work into folders

  • Declutter their workspace

Design System Challenge: Standardization



Users loved:

Reprioritizing based on user feedback

After testing, I presented my user insights to the team and we aligned on the following prioritization of features:

P0: Manual folders for custom organization

P0: Bulk Delete for quick file management

P1: Automated Project Folders for assisted organization & project navigation

Removed from scope: Analytics

Interaction Challenge: Bulk Delete

🚩 Unforseen Constraints


As we approached handoff, we got a back-end engineer assigned to the project. He reviewed the design and informed us that the design required more back-end engineering resources than were available, and that we would likely need to cut some features to meet our launch timeline.

As the user advocate, I needed to determine which features we could cut and still give users a delightful and helpful experience. To make these decisions, I returned to my design goal statement, the PRD, and the strongest user feedback themes. Then, I worked closely with the engineers to understand implementation obstacles and find solutions that met user goals.

Adapting to New Technical Constraints

Reducing Engineering Complexity


Dark mode

Models tab

Projects tab

Next Steps

  • Metric Tracking: User Surveys - We expect to see a decrease in complaints about file management struggles in our yearly survey.

  • Metric Tracking: Content Creation Rates - One business goal of this project is to facilitate more content creation and sharing on Kaggle. We hope to see an increase in engagement and publishing numbers.

  • Future Opportunity: Analytics dashboard Although lightweight analytics weren’t useful to some users, others wanted in-depth performance analytics. Next up, I could partner with Research to explore user interest in an analytics dashboard.

  • Future Opportunity: Collaborative Project Folders During testing, users loved the idea of sharing collaborative project folders; however, that feature was cut from scope due to authorization complexities. Next up, I can partner with Engineering and Product to explore simpler ways to enable advanced project collaboration.

Final Designs & Next Steps

Lessons Learned

  • Include engineers throughout the design process to avoid unforeseen constraints and redesigns. Had I reviewed my designs with a back-end engineer earlier, I would have avoided redesigning an important CUJ at the last minute.

  • Test early whenever possible. Early user feedback helped me refocus the design on what users actually needed

  • Define the MVP early with cross-functional partners to avoid scope creep.

We did not have the resources to develop all of our proposed features. To determine what to prioritize in my designs, we needed to learn which proposed features would help users the most. Specifically, I needed feedback on the utility of four aspects of my design: metadata, folders, bulk actions, and performance analytics. To get answers, I traveled to an international Kaggle conference in Barcelona, where I independently conducted research with 15 Kaggle power users.

Defined research goals w/ PM • Prototyped 4 different designs • Developed research plan • Wrote question script •. Coordinated recruitment and logistics

How I Prepared

Design Phase 2:

Collaborating to save an MVP-Critical Feature

Launched February 2024

Digital Wireframes

Presented to design team for early feedback on scalability, UX consistency, component needs, etc.

Next
Next

Google: Making Machine Learning Competitions More Approachable