Bookmark
A list of free data matching and record linkage software
https://github.com/J535D165/data-matching-software, posted Aug '21 by peter in development free list nlp opensource software
This is a list of (Fuzzy) Data Matching software. The software in this list is open source and/or freely available.
The term data matching is used to indicate the procedure of bringing together information from two or more records that are believed to belong to the same entity. Data matching has two applications: (1) to match data across multiple datasets (linkage) and (2) to match data within a dataset (deduplication). See the Wikipedia page about data matching for more information.
Similar terms: record linkage, data matching, deduplication, fuzzy matching, entity resolution
Bookmark
How to calculate the alignment between BERT and spaCy tokens effectively and robustly
https://gist.github.com/tamuhey/af6cbb44a703423556c32798e1e1b704, posted Jul '21 by peter in development free language nlp opensource software toread
Suppose we want to combine a BERT-based named entity recognition (NER) model with a rule-based NER model built on top of spaCy. Although BERT's NER exhibits extremely high performance, it is usually combined with rule-based approaches for practical purposes. In such cases, what often bothers us is that tokens of spaCy and BERT are different, even if the input sentences are the same. For example, let's say the input sentence is "John Johanson 's house"; BERT tokenizes this sentence like
["john", "johan", "##son", "'", "s", "house"]
and spaCy tokenizes it like["John", "Johanson", "'s", "house"]
. To combine the outputs, we need to calculate the correspondence between the two different token sequences. This correspondence is the "alignment".
Bookmark
Configuration as Code (Jenkins plugin)
https://plugins.jenkins.io/configuration-as-code/, posted Jun '21 by peter in automation continuousdelivery development free software versioncontrol
The Configuration as Code plugin is an opinionated way to configure Jenkins based on human-readable declarative configuration files. Writing such a file should be feasible without being a Jenkins expert, just translating into code a configuration process one is used to executing in the web UI.
Bookmark
lucidrains/deep-daze: Simple command line tool for text to image generation
https://github.com/lucidrains/deep-daze, posted Mar '21 by peter in ai free graphics opensource python software
Bookmark
Jamulus — Play music online. With friends. For free.
https://jamulus.io/, posted Mar '21 by peter in collaboration free music online
Jamulus is software for playing music, rehearsing, or just jamming with anyone online with low latency. You can use your Windows, macOS or Linux machine to connect to Jamulus servers worldwide. Jamulus is free and you can just use your normal broadband connection. Simply connect to a public server or host your own private one. Jamulus has been in development since 2006 and is designed for high quality, low-latency sound, making it easy to play together remotely and in time.
TinyLetter is a personal newsletter service brought to you by the people behind Mailchimp. People use it to send updates, digests, and dispatches to their fans and friends.
Though they're built on the same infrastructure, TinyLetter is for people who don't need all the business features that come along with Mailchimp. Simplicity is at the heart of everything we do at TinyLetter.
TinyLetter is a completely free service.
Bookmark
whip: Simple fast http server for nim based on httpbeast and nest for high performance routing
https://github.com/mattaylor/whip, posted 2021 by peter in development free nimlang opensource software
WHIP is a high performance web application server based on the excellent httpbeast and routing provided by nest with some additional optimizations.
WHIP is still in development and is not recommended for production use. Much is still missing or untested but for basic API use cases however, the performance numbers look pretty good so far.
Bookmark
LibreTranslate: Free and Open Source Machine Translation API
https://github.com/uav4geo/LibreTranslate, posted 2021 by peter in api free language nlp opensource software
Free and Open Source Machine Translation API, entirely self-hosted. Unlike other APIs, it doesn't rely on proprietary providers such as Google or Azure to perform translations.
Bookmark
Magenta
https://magenta.tensorflow.org/, posted 2021 by peter in ai free music opensource software toread
An open source research project exploring the role of machine learning as a tool in the creative process.
Bookmark
axe-core: Accessibility engine for automated Web UI testing
https://github.com/dequelabs/axe-core, posted 2021 by peter in automation development free opensource software testing webdesign
Axe is an accessibility testing engine for websites and other HTML-based user interfaces. It's fast, secure, lightweight, and was built to seamlessly integrate with any existing test environment so you can automate accessibility testing alongside your regular functional testing.
|< First < Previous 11–20 (456) Next > Last >|