progress: Linux tool to show progress for cp, mv, dd, ...
https://github.com/Xfennec/progress, posted 19 May by peter in free linux opensource shell software
This tool can be described as a Tiny, Dirty C command that looks for coreutils basic commands (cp, mv, dd, tar, gzip/gunzip, cat, etc.) currently running on your system and displays the percentage of copied data. It can also show estimated time and throughput, and provides a "top-like" mode (monitoring).
git-remote-gcrypt: PGP-encrypted git remotes
https://github.com/spwhitton/git-remote-gcrypt, posted 28 Mar by peter in development free opensource software versioncontrol
git-remote-gcrypt is a git remote helper to push and pull from repositories encrypted with GnuPG, using a custom format. This remote helper handles URIs prefixed with
https://nim-lang.org/docs/testament.html, posted 24 Feb by peter in development free nimlang software testing
https://www.zaproxy.org/, posted Jul '22 by peter in development free opensource security software toread
The world’s most widely used web app scanner. Free and open source. Actively maintained by a dedicated international team of volunteers.
Melrose, a language to program and play music
ernestmicklei.com/melrose/introduction_melrose/, posted Mar '22 by peter in free music software toread
Melrose is both a language and a tool to create and listen to music interactively, The language uses musical primitives (note, sequence, chord) and many functions (map, group, transpose) that can be used to create more complex patterns, loops and tracks. Melrose uses MIDI output to produce sound by any (hard or software) device attached. Melrose can also react on MIDI inputs to start, record and stop playing musical objects. A plugin is available for Microsoft Visual Studio for the best usage experience. For a quickstart, without any installation, you can use the Melrose playground.
https://woob.tech/applications/smtp, posted Jan '22 by peter in communication email free opensource software toread
Daemon application able to fetch messages from supported websites and send them by mail. It can also be used to send a reply to a message (on a module which supports this feature), by piping an email to it.
Looks like it should be possible to read (perhaps even write) private messages on Reddit without having to deal with Reddit's own, pretty terrible, web interface. Worth looking into.
Typesense | Fast, typo-tolerant open source search engine
https://typesense.org/, posted Aug '21 by peter in development free opensource search software
Typesense is an open source, typo tolerant search engine that is optimized for instant sub-50ms searches, while providing an intuitive developer experience.
A list of free data matching and record linkage software
https://github.com/J535D165/data-matching-software, posted Aug '21 by peter in development free list nlp opensource software
This is a list of (Fuzzy) Data Matching software. The software in this list is open source and/or freely available.
The term data matching is used to indicate the procedure of bringing together information from two or more records that are believed to belong to the same entity. Data matching has two applications: (1) to match data across multiple datasets (linkage) and (2) to match data within a dataset (deduplication). See the Wikipedia page about data matching for more information.
Similar terms: record linkage, data matching, deduplication, fuzzy matching, entity resolution
How to calculate the alignment between BERT and spaCy tokens effectively and robustly
https://gist.github.com/tamuhey/af6cbb44a703423556c32798e1e1b704, posted Jul '21 by peter in development free language nlp opensource software toread
Suppose we want to combine a BERT-based named entity recognition (NER) model with a rule-based NER model built on top of spaCy. Although BERT's NER exhibits extremely high performance, it is usually combined with rule-based approaches for practical purposes. In such cases, what often bothers us is that tokens of spaCy and BERT are different, even if the input sentences are the same. For example, let's say the input sentence is "John Johanson 's house"; BERT tokenizes this sentence like
["john", "johan", "##son", "'", "s", "house"]and spaCy tokenizes it like
["John", "Johanson", "'s", "house"]. To combine the outputs, we need to calculate the correspondence between the two different token sequences. This correspondence is the "alignment".
Configuration as Code (Jenkins plugin)
https://plugins.jenkins.io/configuration-as-code/, posted 2021 by peter in automation continuousdelivery development free software versioncontrol
The Configuration as Code plugin is an opinionated way to configure Jenkins based on human-readable declarative configuration files. Writing such a file should be feasible without being a Jenkins expert, just translating into code a configuration process one is used to executing in the web UI.