Bookmark
How to calculate the alignment between BERT and spaCy tokens effectively and robustly
https://gist.github.com/tamuhey/af6cbb44a703423556c32798e1e1b704, posted 2021 by peter in development free language nlp opensource software toread
Suppose we want to combine a BERT-based named entity recognition (NER) model with a rule-based NER model built on top of spaCy. Although BERT's NER exhibits extremely high performance, it is usually combined with rule-based approaches for practical purposes. In such cases, what often bothers us is that tokens of spaCy and BERT are different, even if the input sentences are the same. For example, let's say the input sentence is "John Johanson 's house"; BERT tokenizes this sentence like
["john", "johan", "##son", "'", "s", "house"]
and spaCy tokenizes it like["John", "Johanson", "'s", "house"]
. To combine the outputs, we need to calculate the correspondence between the two different token sequences. This correspondence is the "alignment".
Bookmark
Covid: Misleading stat claims more vaccinated people die
https://www.bbc.com/news/health-57610998, posted 2021 by peter in conspiracy covid19 health science toread
One conspiracy site even claimed vaccinated people were dying at higher rates than those who had not received the jab, which is untrue.
This site and others use real figures in a misleading way, to arrive at a completely false conclusion - that the vaccine may not be working or even doing more harm than good.
Bookmark
How to Handle Privacy Access Requests Under the GDPR - Privacy Policies
https://www.privacypolicies.com/blog/gdpr-privacy-access-requests/, posted 2021 by peter in business eu howto law reference toread
Are you prepared to receive and process privacy access requests in compliance with the GDPR? The following guide will help you understand your role in promoting access to data and how to create a system that saves you time and prevents damage to your reputation.
Bookmark
Matthew Walker's "Why We Sleep" Is Riddled with Scientific and Factual Errors - Alexey Guzey
https://guzey.com/books/why-we-sleep/, posted 2021 by peter in health science toread
Any book of Why We Sleep's length is bound to contain some factual errors. Therefore, to avoid potential concerns about cherry-picking the few inaccuracies scattered throughout, in this essay, I'm going to highlight the five most egregious scientific and factual errors Walker makes in Chapter 1 of the book. This chapter contains 10 pages and constitutes less than 4% of the book by the total word count.
Bookmark
Making Sense of The Audio Stack On Unix
https://venam.nixers.net/blog/unix/2021/02/07/audio-stack.html, posted 2021 by peter in audio hardware linux reference software toread
Audio on Unix is a little zoo, there are so many acronyms for projects and APIs that it's easy to get lost. Let's tackle that issue! Most articles are confusing because they either use audio technical jargon, or because they barely scratch the surface and leave people clueless. A little knowledge can be dangerous.
In this article I'll try to bridge the gap by not requiring any prerequisite knowledge while also giving a good overview of the whole Unix audio landscape. There's going to be enough details to remove mysticism (Oh so pernicious in web bubbles) and see how the pieces fit.
Bookmark
The Cryptopals Crypto Challenges
https://cryptopals.com/, posted 2021 by peter in development learning security toread
This is a different way to learn about crypto than taking a class or reading a book. We give you problems to solve. They're derived from weaknesses in real-world systems and modern cryptographic constructions. We give you enough info to learn about the underlying crypto concepts yourself. When you're finished, you'll not only have learned a good deal about how cryptosystems are built, but you'll also understand how they're attacked.
Bookmark
Magenta
https://magenta.tensorflow.org/, posted 2021 by peter in ai free music opensource software toread
An open source research project exploring the role of machine learning as a tool in the creative process.
Bookmark
Nim by Example
https://nim-by-example.github.io/, posted 2020 by peter in development howto nimlang reference toread
Nim is a powerful statically typed language that allows the programmer expressiveness without compromising run-time performance. As a general purpose programming language, it gives the same sort of power and performance as C++, but in a nicer package and with even more powerful tools!
Bookmark
Tips to stay focused and finish your hobby project
https://stackoverflow.blog/2020/12/03/tips-to-stay-focused-and-finish-your-hobby-project/, posted 2020 by peter in howto inspiration management opinion toread
As I said, when I got stuck, the problem was never with coding or other technical issues. Of course, understanding every little detail of Flask was difficult sometimes—I was also hunting bugs for hours, sure. But the things that stopped me were mostly mindset related.
So here are a few practical pieces of advice to get over these issues—for my future self and for you—if you want to get a hobby project done!
Bookmark
The (Literally) Unbelievable Story of the Original Fake News Network
https://narratively.com/the-literally-unbelievable-story-of-the-original-fake-news-network/, posted 2020 by peter in news politics propaganda toread war
And that radio station everyone was reacting to? It wasn’t even in Guatemala. The disc jockeys aired their “reports” from a shack in Nicaragua. Many of their broadcasts had actually been prerecorded earlier in the year.
In Florida.
In an office belonging to the Central Intelligence Agency.
The radio station that had all of Guatemala in such a frenzy was part of a secret CIA “terror program based on Orson Welles,” declassified documents now show. It was overseen by an American actor and spy novelist whose salary was paid by U.S. tax dollars. The whole operation was, to use today’s parlance, “fake news.”
|< First < Previous 31–40 (473) Next > Last >|