Show more

Scraping 

Show thread

Scraping 

Show thread

It's also nice that all the conclusions in the paper are wrong because they start with a mistaken premise that content warnings mean that a post is "inappropriate".

Show thread

Scraping 

Both masto instances (scholar.social & wandering.shop) I use have been scraped by the University of Milan scrape.

See sunbeam.city/@puffinus_puffinu for the list.

⚠️ The Fediverse has been scraped, again ⚠️

Almost six million posts from 363 instances have been scraped.

"All the posts with public visibility published by users hosted on Mastodon servers [...] which support the English language" have been scraped along with their metadata, and the "policy, the code of conduct and the prohibited contents of each instance".

The dataset is an attempt at creating an open dataset for "research" into algorithms like the ones Facebook uses to identify problematic content, based around users' use of Content Warnings.

The dataset can be found here:
dataverse.harvard.edu/dataset.

It was created by the University of Milan, Italy, apparently for the 13th AAAI:
aaai.org/

The associated publishing:
aaai.org/ojs/index.php/ICWSM/a or likeable.space/media/30ae595a1 or DM me for a copy.

Related dataset:
dataverse.mpi-sws.org/dataset.

Original post:
likeable.space/objects/98fe744 @tastytea

#FediAdmin #MastoAdmin #MastoDev #Privacy #OpSec #Warning #Fediverse #Mastodon #Scraping

I'm testing out @pinafore and I like it so far. I just wish the option to switch instances could be added to the tabs up top.

Guess what folks. I just read a paper (reread, I skimmed it in 2017 according to my notes) and I needed it for my paper which I submitted recently. Lol. Ugh. Reading never ends.

I had a very bizarre small world moment earlier today when I was looking through the faculty list of a department I'm applying to. One of the lecturers at this small liberal arts college in North Africa has the name (and clearly past address by virtue of alma mater) of a person whose voucher and catalog subscriptions used to arrive at my friend's apartment.

Quick poll: is anyone here running an instance of or to host personal content (e.g. lectures, public teaching etc)?

I decided last week to translate one speculative poem a week this year. The plan is to read at least one daily poem in the genre so that each weekend I can choose one to translate into Arabic. To make it easy on me I'll be doing a daily poem thread on here all year.

tfw you need to get an official version of a draft chapter you've been reading to cite it somewhere and the nearest library holding it is in Malta.

the luddites did not destroy machines in opposition to machinery itself but to those owners that used the machinery to impoverish and immiserate the people ✊️

Show thread

150 pages into Tocqueville and this is what I got so far

Tocqueville << America rules because Puritans >>
Everyone else << what? >>
Tocqueville << I said what I said >>

Teaching is great because sometimes it makes you re-think the overly complicated way you did something in a previous project and come up with something simpler and equally good, just for the sake of being better able to teach it

What do y'all do when you're a PhD graduate with no employment prospects and you get an email from a conference organizer trying to check the affiliation?

job market strategies, 

Show more
Scholar Social

The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!