About
Karol is a highly experienced senior data scientist with a strong focus on NLP and wide AI applications. He has a unique academic background in physics and large-scale models, in addition to relevant experience in the customer-facing data science industry. He enjoys working with data and leading and implementing R&D projects. With his PhD in physics and the recent MBA degree, Karol combines easy technology and business.
Experience
About 12.4 yrs of professional experience, estimated from the roles below (overlaps counted once).
- Jan 2023Present
Co-founder
Fractile
Designed and co-implemented a platform to create, customize, and apply AI agents. The agents disposed of an additional layer of security, thanks to the anonymization service that protects 100% of the data from the client's server. Applied AI agents with the ability to learn on previous tasks and be spawned via Chat, API, or Jira. Deployed the solution to two medium-sized companies.
- Jan 2021 – Jan 2023
Warsaw Stock Exchange
Led the development process of a scalable exchange platform for personalized ads on Polish TV. Designed logo placement detection AI in live-streamed TV broadcasts. Led the development team of nine, implementing a supply-side platform, end-to-end, from conceptualization and MVP to a scalable production stage. Created an end-to-end pipeline to train the behavioral models based on the data from TV.
- Jan 2021 – Jan 2023
Lead Data Scientist
Sweetgreen Inc - Main
Designed and led the implementation of the salad recommendation engine at the production level. Created a BI tool with live-updated sales data and ML forecasting for the CxOs. Built a PoC sales forecasting model based on historical sales and weather forecast data. Improved the legacy ML production models for supply chain forecasting. Managed to improve the processing time by one order of magnitude.
- Jan 2020 – Jan 2021
Senior Data Scientist
Meloncast
Created a complete training and deploying pipeline for NLP models (BERT) to classify target audience marketing texts. Trained ML models recognizing most similar pictures in terms of content and coloristic that the client provided. Designed and deployed a production-level API for containerized Docker services.
- Jan 2019 – Jan 2021
Lead Data Scientist
Physica Solutions
Built an NLP ecosystem for using ChatGPT on the company's private data. Created subMIND, a tool for extracting subconscious information from a large body of text that uses state-of-the-art techniques for entity recognition, graph relations, and visualizations. Built Microsoft Power BI reports for a private Polish university, working directly with the business. Designed an architecture for classifying fake news in social media for the most prominent Polish university, including NLP (BERT) classification, data collection, and overall flow.
- Jan 2019 – Jan 2021
Lead Data Scientist
Yieldbird
Optimized pricing models for online ad auctions using ML tools. Created an entire ML pipeline, including data ingestion, testing, prototyping, error handling, monitoring, and evaluation. Directed the process of product development from the R&D side, including hypothesis testing and handling client feedback
- Jan 2018 – Jan 2019
Data Scientist
DS Stream
Created Tableau reports identifying fraudulent behavior of employees. Built a fully automated quality assurance system for data ingestion. Designed a Twitter fake news detector front end for data visualization.
- Jan 2016 – Jan 2017
Postdoctoral Researcher
Lawrence Berkeley National Lab
Carried out state-of-the-art research using molecular dynamics and Monte Carlo simulations on nanoscopic materials. Published three technical papers in a highly respected scientific journal. Created, simulated, and interpreted numerical simulations with over 10^7 degrees of freedom.
- Jan 2012 – Jan 2015
Doctoral Researcher
ETH Zurich
Carried out numerical simulations that resulted in models further used by other team members. Published nine technical papers in top-ranked journals as the first author. Contributed to the physical chemistry field by explaining the water adsorption-related phenomena in cellulose.
- Jan 2011 – Jan 2011
Intern
Texas A&M University
Created a numerical model of the secondary loop of the BWR nuclear reactor under the direction of Professor J. Ragusa. Applied the Monte Carlo method for sensitivity analysis of numerical coefficients in different equation functions of the state. Expanded the lab's Python library for carrying out finite element method simulations.