PyXLL The Python Excel Add-In
  • Product
    • Features
    • Get Started
    • Request Demo
    • Download
  • Pricing
  • Resources
    • Documentation
    • Blog
    • Videos
    • FAQ
    • Learn Python
    • Customer Portal
    • About Us
  • Support
    • Documentation
    • Videos
    • FAQ
    • Contact Us
  • Contact Us
Table of Contents
  • PyXLL Documentation
  • Introduction to PyXLL
  • User Guide
  • Video Guides and Tutorials
    • Installing the PyXLL Add-in
    • Writing Excel Functions in Python
    • Jupyter Notebooks in Excel
    • Debugging Python in Excel
    • Working with Tables
    • Monte Carlo Simulations
    • Cell Formatting
    • RTD Array Functions
    • DOOM in Excel
    • NLP VLOOKUP using Scikit Learn
    • Deploying your PyXLL Add-in
  • API Reference
  • What’s new in PyXLL 5
  • Changelog
Close

NLP VLOOKUP using Scikit Learn¶

Build a VLOOKUP alternative using a natural language processing (NLP) technique from scikit-learn.

Excel’s VLOOKUP function is a well known way of finding data in a table based on a key lookup, but often an exact string lookup isn’t convenient and a more ‘fuzzy’ matching is a better way joining two data sets together. For example, when comparing company names it is very common to have slight differences in how the same names appear in different data sets (eg ‘Apple’ vs ‘Apple Inc’).

This video demonstrates a straightforward but effective string matching technique using Natural Language Processing. Using PyXLL, the resulting function can be used from Excel as an alternative to VLOOKUP.

Improving Excel's VLOOKUP with Natural Language Processing
×

During this video you will learn about the following topics:

  1. Match similar but not identical strings using scikit-learn.

  2. How to write a Python function to be called from Excel using PyXLL.

  3. Use type hints for argument and return types.

The code is available on GitHub here: https://github.com/pyxll/pyxll-examples/tree/master/scikit-learn/nlp_vlookup

Inspired by the article by Mauro Di Pietro: https://towardsdatascience.com/surpass-excel-vlookup-with-python-and-nlp-ab20d56c4a1a

« DOOM in Excel
Deploying your PyXLL Add-in »
  • Home
  • Product
  • Features
  • Documentation
  • Download
  • Pricing
  • Support
  • Documentation
  • Videos
  • FAQ
  • Learn Python
  • Contact Us
  • About
  • About Us
  • Legal
  • Blog
© Copyright PyXLL Ltd