February, 2015

article thumbnail

Introducing spaCy

Explosion

spaCy is a new library for text processing in Python and Cython. I wrote it because I think small companies are terrible at natural language processing (NLP). Or rather: small companies are using terrible NLP technology. This post shows the original launch announcement for spaCy , which came with some usage examples and benchmarks. The benchmarks are quite out of date, but I’m pleased to say usage has changed relatively little.

article thumbnail

Most HR Data Is Bad Data

Hacker News

'We are sorry. This site is optimized for Internet Explorer 9 and above. Please update your browser. We are sorry. This site is optimized for Safari 7 and above. Please update your browser. 4 / 5 Free Articles left Remaining. Register for more. |. Subscribe + Save! Register. Sign In. Sign In. Your cart. Your Shopping Cart is empty. Visit our store. Go to Cart.

Analytics 140
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Introducing spaCy

Explosion

Computers don't understand text. This is unfortunate, because that's what the web almost entirely consists of. We want to recommend people text based on other text they liked. We want to shorten text to display it on a mobile screen. We want to aggregate it, link it, filter it, categorise it, generate it and correct it. spaCy provides a library of utility functions that help programmers build such products.

40
article thumbnail

How spaCy Works

Explosion

This post was pushed out in a hurry, immediately after spaCy was released. It explains some of how spaCy is designed and implemented, and provides some quick notes explaining which algorithms were used. The post pre-dates spaCy's named entity recogniser, but it provides some detail about the tokenisation algorithm, general design, and efficiency concerns.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

How spaCy Works

Explosion

The following are some hasty preliminary notes on how spaCy works. The short story is, there are no new killer algorithms. The way that the tokenizer works is novel and a bit neat, and the parser has a new feature set, but otherwise the key algorithms are well known in the recent literature. This post was pushed out in a hurry, immediately after spaCy was released.