The ethical considerations are really something.
It's also nice that all the conclusions in the paper are wrong because they start with a mistaken premise that content warnings mean that a post is "inappropriate".
@jaranta The database appears to have been retracted already
@bgcarlisle That was fast. Good.
@jaranta Thanks for the actual paper link, let's see it. :P I'll look at the ethics once I see it, but that title and its "inappropriate" is hilarious in itself from a methods perspective - this is why you don't rely on automated data collection without actually having a living human enter the context and look around first.
@werekat It's as if you need to understand the topic your researching.
@jaranta Yes. I only read the abstract but this is painfully clear.
@jaranta just wow.
@jaranta They anonymized the data, right ... and those were public posts.
What am i missing?
@qcat It's not possible to anonymise textual data so that the original text is impossible to find afterwards, meaning that it is impossible to anonymise textual data. There are methodological work-arounds that are used in internet research, but these computer scientists probably did not know about them.
@jaranta Yeah, they said they anonymised the user data, but if nobody knows who posted what, then how relevant is the content? Also, I don't see them using any explicit text in their analysis. Of course if they keep that dataset lying around (or even worse, the non-anonymised version) and provide it to third parties, that would indeed be problematic.
@qcat They published the dataset. It's since been taken down.
@jaranta Ah, ok. And the implication being that anyone can find the original posts even if you don't know the user and imstance if you search through the instance list tjey used. That's the concern i guess?
Scholar Social is a microblogging platform for researchers, grad students, librarians, archivists, undergrads, academically inclined high schoolers, educators of all levels, journal editors, research assistants, professors, administrators—anyone involved in academia who is willing to engage with others respectfully.