Suggest Edits
Global marketing investment is projected to grow 8% to over $1 trillion in 2024. As companies see their costs per acquisition go up, many turn to Kumo to help drive efficient growth through machine learning.
Kumo’s predictive query language makes it easy to build a wide variety of ML models for both B2B and B2C growth teams, including churn, lifetime value (LTV), lead ranking, conversion propensity (CTR), personalized call to actions, user segmentation, and item recommendations. And by using graph neural nets (GNNs) and large language models (LLM) with complex relational data, Kumo delivers superior predictive accuracy when compared with traditional customer relationship management (CRM), account based marketing (ABM), and personalization solutions.
Kumo’s integrations make it easy to plug into existing tools, enabling teams to deliver more models in less time. Additionally, data warehouse native deployments minimize the need for time-consuming security reviews, by performing data processing within your Snowflake or Databricks accounts.
Kumo’s predictive query language (PQL) provides the flexibility to build a wide variety of growth and marketing models. PQL statements enable data scientists declare the entity, target, filters, and optimization goal of a predictive task, in a purely declarative manner. In practice, this enables teams to quickly build GNN-powered recommendations for dozens of predictive tasks across the customer acquisition funnel.
Here are a few of the solutions that Kumo supports.
Kumo does not require data to be transformed to fit a prescriptive schema, nor does it require the installation of a tracking pixel.
Instead, Kumo makes predictions directly from the raw data that already exists in the data warehouse. The Kumo graph builder makes it easy to stitch together data from many different sources. Just connect the tables to the graph and go.
For the best results on growth and marketing use cases, Kumo encourages using data such as:
Since Kumo can stitch together signal across multiple different data sources, Kumo can achieve significantly better predictive accuracy than traditional alternatives.
Kumo reads and writes data directly to the client’s data lakehouse, supporting cloud-first data science workflows. Supported lakehouses include Snowflake, Databricks, AWS S3, GCP BigQuery, and Azure Synapse (coming soon). For example, users have found success using Kumo as part of a DBT-based development environment, using Airflow for orchestration, and Streamlit for consumption.
Additionally, Kumo provides data warehouse native deployment options, which keeps your data secure by performing data processing within your Snowflake or Databricks account. This makes Kumo suitable for use in highly regulated environments, including banking, healthcare, and government.
Kumo uses a distributed GNN training system, written in C++, which can handle multi-terabyte datasets with tens of billions of rows, and has customers that make daily recommendations for more than 100M active users or more than 10M inventory items.
Because GNNs are great at discovering complex patterns in sparse data, Kumo is also a good fit for small datasets containing 1000’s of users, and only 10’s of items.
In order to cover both in-product and out-of-product growth use cases, Kumo supports a variety of serving methods:
Kumo recommendations are powered by a GNN architecture, inspired by several academic papers in recent history. Data scientists can benefit from these advances in model architecture, without needing to code them up manually.
Here is some of the research that is used by Kumo AI:
Kumo also uses a powerful data encoding stack to convert multi-modal data into representations for deep learning.
The Kumo model planner empowers data scientists to quickly iterate and apply their domain knowledge to the model.
Specifically, the Kumo model planner gives control over:
For example, in the world of e-commerce, it is very common for user behavior to differ between the “December holiday season” and the rest of the year. If you want to make sure your churn prediction model can generalize through the year, tune the split parameter of the model planner, to pick appropriate time frames for train, validation, and test.
Data Scientists have achieved 73% performance lift on a “next best action” model, using the fine-grained controls offered in the model planner.
PQL is a declarative syntax for defining machine learning problems. It is highly flexible and easy to learn, supporting inline filters, boolean expressions, and aggregation functions.
Data scientists can quickly experiment with many different and complex predictive formulations of a machine learning problem in very few lines of code.
For example, the following “lead ranking” query predicts whether a each active lead will have a conversion in the next N days, assuming that a sales person reaches out to them tomorrow.
As part of the training process, Kumo automatically computes data visualizations and metrics to help understand the model’s strengths and weaknesses.
In order to support ongoing validation of model correctness, Kumo has the following features related to MLOps:
Because Kumo writes directly to the data lakehouse, it is easy to connect with other cloud software commonly used in growth, marketing, or sales. Here are just a few examples:
Suggest Edits
Global marketing investment is projected to grow 8% to over $1 trillion in 2024. As companies see their costs per acquisition go up, many turn to Kumo to help drive efficient growth through machine learning.
Kumo’s predictive query language makes it easy to build a wide variety of ML models for both B2B and B2C growth teams, including churn, lifetime value (LTV), lead ranking, conversion propensity (CTR), personalized call to actions, user segmentation, and item recommendations. And by using graph neural nets (GNNs) and large language models (LLM) with complex relational data, Kumo delivers superior predictive accuracy when compared with traditional customer relationship management (CRM), account based marketing (ABM), and personalization solutions.
Kumo’s integrations make it easy to plug into existing tools, enabling teams to deliver more models in less time. Additionally, data warehouse native deployments minimize the need for time-consuming security reviews, by performing data processing within your Snowflake or Databricks accounts.
Kumo’s predictive query language (PQL) provides the flexibility to build a wide variety of growth and marketing models. PQL statements enable data scientists declare the entity, target, filters, and optimization goal of a predictive task, in a purely declarative manner. In practice, this enables teams to quickly build GNN-powered recommendations for dozens of predictive tasks across the customer acquisition funnel.
Here are a few of the solutions that Kumo supports.
Kumo does not require data to be transformed to fit a prescriptive schema, nor does it require the installation of a tracking pixel.
Instead, Kumo makes predictions directly from the raw data that already exists in the data warehouse. The Kumo graph builder makes it easy to stitch together data from many different sources. Just connect the tables to the graph and go.
For the best results on growth and marketing use cases, Kumo encourages using data such as:
Since Kumo can stitch together signal across multiple different data sources, Kumo can achieve significantly better predictive accuracy than traditional alternatives.
Kumo reads and writes data directly to the client’s data lakehouse, supporting cloud-first data science workflows. Supported lakehouses include Snowflake, Databricks, AWS S3, GCP BigQuery, and Azure Synapse (coming soon). For example, users have found success using Kumo as part of a DBT-based development environment, using Airflow for orchestration, and Streamlit for consumption.
Additionally, Kumo provides data warehouse native deployment options, which keeps your data secure by performing data processing within your Snowflake or Databricks account. This makes Kumo suitable for use in highly regulated environments, including banking, healthcare, and government.
Kumo uses a distributed GNN training system, written in C++, which can handle multi-terabyte datasets with tens of billions of rows, and has customers that make daily recommendations for more than 100M active users or more than 10M inventory items.
Because GNNs are great at discovering complex patterns in sparse data, Kumo is also a good fit for small datasets containing 1000’s of users, and only 10’s of items.
In order to cover both in-product and out-of-product growth use cases, Kumo supports a variety of serving methods:
Kumo recommendations are powered by a GNN architecture, inspired by several academic papers in recent history. Data scientists can benefit from these advances in model architecture, without needing to code them up manually.
Here is some of the research that is used by Kumo AI:
Kumo also uses a powerful data encoding stack to convert multi-modal data into representations for deep learning.
The Kumo model planner empowers data scientists to quickly iterate and apply their domain knowledge to the model.
Specifically, the Kumo model planner gives control over:
For example, in the world of e-commerce, it is very common for user behavior to differ between the “December holiday season” and the rest of the year. If you want to make sure your churn prediction model can generalize through the year, tune the split parameter of the model planner, to pick appropriate time frames for train, validation, and test.
Data Scientists have achieved 73% performance lift on a “next best action” model, using the fine-grained controls offered in the model planner.
PQL is a declarative syntax for defining machine learning problems. It is highly flexible and easy to learn, supporting inline filters, boolean expressions, and aggregation functions.
Data scientists can quickly experiment with many different and complex predictive formulations of a machine learning problem in very few lines of code.
For example, the following “lead ranking” query predicts whether a each active lead will have a conversion in the next N days, assuming that a sales person reaches out to them tomorrow.
As part of the training process, Kumo automatically computes data visualizations and metrics to help understand the model’s strengths and weaknesses.
In order to support ongoing validation of model correctness, Kumo has the following features related to MLOps:
Because Kumo writes directly to the data lakehouse, it is easy to connect with other cloud software commonly used in growth, marketing, or sales. Here are just a few examples: