Coris AI launches Merchant Real Industry using GPT-4

Vinodh Poyyapakkam

December 11, 2023

Coris AI is building the modern risk infrastructure for payment processors. We’re excited to announce Merchant Real Industry, the firstclassification model that uses GPT-4 to automatically determine a merchant’s MCC and NAICS codes with >90% accuracy.

‍

The problem: Manual merchant industry classification

Payment processors play a pivotal role in powering purchases from online and brick-and-mortar merchants, but many of these organizations still rely on manual processes for merchant underwriting.

A key part of the merchant underwriting process is determining a merchant’s Merchant Category Code (MCC). For certain purposes, determining the merchant’s North American Industry Classification System (NAICS) code also becomes important.

Accurate industry classification of merchants is important for payment processors to manage risk, comply with regulations, and provide efficient services. It helps payment processors pre-qualify potential merchants, identify prohibited or high-risk businesses, and monitor for fraudulent activities.

However, determining a merchant’s MCC and NAICS code classification can be challenging. There are various factors that go into determining these codes: the product/service sold, mode of sale (in-person vs. online), method of sale (one-time or recurring), whether there is a free trial, etc.

“Proper, at-scale industry classification of businesses has been a very difficult problem to solve for data scientists working in payments. Prevailing systems have not worked well for a variety of reasons including the diverse nature of businesses, hard-to-get sources of sufficiently accurate business information, and extremely imbalanced training sets.”

- Michael Maze, Former Head of Seller Risk Data Science at PayPal

With our new model, Coris AI is taking the guesswork out of the industry classification process and empowering risk practitioners to work more efficiently.

‍

Coris AI’s model

We built our own MCC and NAICS code classification models based on available small business data such as data from merchants’ websites, online reviews, and other third party online sources. Based on our testing, our model can predict a merchant’s MCC and NAICS codes in a few seconds.

‍

How we built this using GPT-4

Once OpenAI’s GPT-4 API became available last month, we started building our model on top of GPT-4. With our decades of experience building merchant risk tools, we were well aware that this would take more than using the latest Large Language Model (LLM) out there. At a high-level, our automated industry classification process is outlined below:

Collect basic merchant information from customers, and determine their online presence in various platforms and governmental data.
Apply proprietary entity resolution techniques to validate and verify the accuracy of the information.
Identify the merchant’s website (if available) and scrape relevant content for industry classification.
Leverage additional information from online platforms to gain insights into the merchant’s industry and business activities.
Run the collected data through our classification model to determine the MCC and NAICS codes.
Apply predefined thresholds to ensure reliable and accurate classification results.

‍

Based on our testing, we have seen the results from this model to be over 90% accurate in classifying a business into the right MCC code and NAICS code that a trained human being would approve of. While we are early, we recently demoed our model at an industry conference and received extremely positive responses from customers and industry experts alike.

While there is a lot of hype surrounding AI and LLMs such as GPT-4, we believe there is true potential in deploying this technology in specific areas of the merchant risk management process. In the case of industry classification, there is a host of contextual information about a merchant from various sources.

LLMs such as GPT and LLaMA have shown to excel at capturing semantic meaning through embeddings, which are numerical representations that convey the essence of a sentence’s meaning. These embeddings, along with the capabilities of LLMs, offer a natural solution to the problem at hand as they have proven their ability to grasp concepts and meaning, even when expressed in diverse ways using natural language. We don’t believe LLMs will be the right approach for every problem we are looking to solve for our customers, but they certainly proved the best for this problem.

‍

What’s next?

By accurately and automatically classifying merchants, payment processors and anyone who manages SMB risk can profoundly improve their risk management practices. Customers can access these attributes through our developer-first APIs or with a file upload in our portal. In the coming months, our team at Coris is excited to roll out several new products and features that will further improve our customers’ risk management processes.

If this is of interest to you or have any feedback, please contact us through our website.

Related Resources

How Autura Scaled Embedded Payments with Confidence Using Coris’ AI-Powered Risk Platform

The Complete Guide to Modern Risk Management for Payment Companies

Integrating Intercom: Automating Risk Communications

5 Capabilities Defining the Future of Risk Management

New Feature: Creditsafe Integration

The Future of Fraud Defense: AI Platforms That Protect Profits

Continuous Merchant Monitoring: Why Point-in-Time Reviews Aren’t Enough

Launching Exposure Calculator

Beyond Manual Reviews: AI Risk Management for Payment Processors

Introducing: Website Monitoring

7 AI-Powered Risk Management Solutions That Beat Fraud Every Time

Why Traditional Fraud Models Fail in the SMB Space

Launching Commercial Credit Data Integration

Introducing Bank Account Verification Without the Friction

5 Ways AI is Transforming Compliance for Fintech and Payment Platforms

Launching the Coris Intelligence Score

The True Cost of Manual Merchant Underwriting (And How to Automate It)

Launching the world’s first credit card transaction model for software platforms

Introducing social media data for creator risk monitoring

Foundation Finance automates merchant monitoring and prevents downstream fraud losses

Introducing CaseWatch

CorShield now predicts identity fraud for merchants in China

How to manage seller fraud on marketplaces

Introducing team assignments & custom SLAs

Introducing Reporting & Insights

Introducing our Zendesk Integration

Introducing the new Coris portal

Effectiv partners with Coris to automate SMB risk & fraud decisioning

Introducing our ACH fraud model

Kajabi consolidates payments risk monitoring activities with Coris

Introducing Risk AI for Automated Underwriting

Introducing our Stripe Radar integration

Introducing Risk AI, your agent for risk management workflows

Leading PayFac prevents six-figure fraud losses with Coris

What's next for AI in risk management?

Prohibited, restricted and high risk businesses: what they are, and how to automatically screen for them

Alloy partners with Coris to automate SMB risk & fraud intelligence

How does GPT-4-powered merchant industry classification compare to a risk analyst?

Introducing Account Graph

Introducing Adverse Media Insights

How do Stripe Connect Custom customers manage SMB risk?

Recap: Perspectives on Scaling Embedded Payments

Introducing our Adyen Integration

Introducing MerchantVision, powered by GPT-4 with vision

Announcing our fundraising, new SMB fraud model & KYB

New year, same risks?

Reducing fraud through MerchantProfiler

Recap: Risk management as a growth lever in a downturn

Introducing Account Tags

Our feature in the "Leaders in Payments" podcast

Recap: How to successfully manage embedded payments

Announcing our SOC 2 Type 2 Compliance

How to build a risk program for your payments product

Introducing IndustryMark

Introducing Fuzio, the first Merchant Risk Operating System

How to choose a payments model for your software platform

Introducing Case Management

Announcing our integration with Stripe Connect

Announcing our expansion to 42 additional countries

Announcing Coris’s SOC 2 Type I Compliance

Going global: Expanding to the U.K., Canada, and Australia

Automating Merchant Underwriting with Coris’s SiteRating

Coris AI launches Merchant Real Industry using GPT-4

Risk 101: Everything You Need to Know

The 5 Levels of Automating Small Business Underwriting

Why Monitoring Merchant Risk Needs to be Elevated

Strong Infrastructure is the Key to Scaling SMB Underwriting

Automation of Merchant Risk Monitoring is Essential

Coris AI launches Merchant Real Industry using GPT-4

Vinodh Poyyapakkam

April 18, 2023

‍

The problem: Manual merchant industry classification

Payment processors play a pivotal role in powering purchases from online and brick-and-mortar merchants, but many of these organizations still rely on manual processes for merchant underwriting.

- Michael Maze, Former Head of Seller Risk Data Science at PayPal

With our new model, Coris AI is taking the guesswork out of the industry classification process and empowering risk practitioners to work more efficiently.

‍

Coris AI’s model

‍

How we built this using GPT-4

Collect basic merchant information from customers, and determine their online presence in various platforms and governmental data.
Apply proprietary entity resolution techniques to validate and verify the accuracy of the information.
Identify the merchant’s website (if available) and scrape relevant content for industry classification.
Leverage additional information from online platforms to gain insights into the merchant’s industry and business activities.
Run the collected data through our classification model to determine the MCC and NAICS codes.
Apply predefined thresholds to ensure reliable and accurate classification results.

‍

What’s next?

If this is of interest to you or have any feedback, please contact us through our website.