hey Fediverse, a question to those building and maintaining the software and infra that runs all this:
do you know if there are any public dataset of the fediverse?
the idea is for use it in a data science project for the course I'm currently taking.
BREAKING: Mega-corp #Meta wants to silence #whistleblower because the truth about #Facebook was just *too* captivating.
They're probably worried her book will reveal that Facebook's secret sauce is just cat memes and data mining.
Who knew!?
https://www.engadget.com/social-media/meta-is-trying-to-stop-a-former-employee-from-promoting-her-book-about-facebook-004938899.html #Secrets #CatMemes #DataMining #HackerNews #ngated
Posted this on Facebook 10 days ago, after an extended hiatus there, and just prior to doing another extended logging off - I think it is relevant to all those moving away from corporate social media as well - read the mirrored post at:
https://diasporasocial.net/posts/545d82b0e018013d71e20a04ffb1b246
or
https://my-place.social/display/e599373b-1867-cf4d-0595-6a8345192377
#Stromverbrauch vom Pflanzenlicht und der Heißluft Friseuse kenne ich jetzt.
#Test im Keller mit der Reichweite erfolgreich. Neue Verbraucher zum analysieren.
Schuko #Ladeziegel #Auto. #Solar Pumpe und #Warmwasser #Pumpe.
#pv #speicher #balkonsolar #datamining
Our colleague Hidir Arras from patent4science research is co-organizing the 6th PatentSemTech Workshop at #SIGIR2025 in the beautiful city of Padua, Italy! Call for Papers is open 'til April 23: http://ifs.tuwien.ac.at/patentsemtech/
Submit your cutting-edge research, case studies, and demos exploring #AI, #NLP, and #TextMining innovations applied to #IP and related domains.
[Atelier Data] Le lab INA organise un atelier @iscpif le 12 mars à 17h30 consacré à l’exploration (#statistique, #TAL…) de transcriptions de JT TF1 et FR2
Il reste encore quelques places : https://framaforms.org/atelier-donnees-ina-1739180738
Une certaine autonomie avec les outils d'analyse quantitative (Python ou R, CSV, etc.) est nécessaire afin de pouvoir profiter pleinement de l'atelier.
shevabam/get-rss-feed-url-extension: Retrieve RSS feeds URLs from WebSite - Chrome Extension
A simple yet effective tool for the RSS data mining toolbox. The browser extension simplifies the process of finding RSS feed URLs by displaying them directly. No more digging through a website's source code for RSS or Atom meta tags.
The extension does not collect or store any user information.
Das stimmt ja mal sowas von !!
Klare Ansage.
#DataCenters are consuming large amounts of #energy & #water. Data centers are used for #AI , #socialmedia , #cyptocurrency , #cloud storage , #dataMining ... ; threatening our #environment . Reduce using AI, #bitcoin , cloud storage. Store large files locally. Will local data center consume a large amount of your water supply? "AI data centers are guzzling energy and water at alarming rates." #Google #Apple #Amazon #Microsoft #Meta
https://www.yahoo.com/news/report-uncovers-disturbing-secret-tech-111528733.html
I was looking for a parseable Wiktionary dump and discovered Kaikki.org, a digital archive and data mining group. They offer a massive, parseable dataset in JSONL format.
https://kaikki.org/dictionary/rawdata.html
#opendata #opensource #wiktionary #dataset #datamining #ai #ml
Nouveaux outils pour la fouille de textes !
L'infrastructure Istex lance 2 nouveaux services puissants pour l'analyse de documents : Teeft : Extraction rapide des termes clés
TermSuite : Extraction terminologique avancée
#TextMining #DataMining #ScienceOuverte
https://www.inist.fr/nos-actualites/nouveaux-outils-pour-la-fouille-de-textes-decouvrez-teeft-et-termsuite/
Heeel fijn dat jullie een ethisch alternarief bieden aan jullie leden @PartijvoordeDieren !
#Vermont based Cluster members Donna Rizzo and Byung Lee collaborated on this article for #DataMining and Knowledge Discovery.
"We use data from an existing multi-site epidemiological study of serious illness conversations as one example of how efficient computational methods can add to the science of healthcare communication."
Call for Papers for IDEAS 2025.
Location: School of Computing at Newcastle University
When: 14ᵗʰ - 16ᵗʰ of July, 2025
Submission Deadline: 15ᵗʰ of May, 2025
Notification of Acceptance: 13ᵗʰ of June, 2025
The annual IDEAS conference is a top international forum for data engineering researchers, practitioners, developers, and application users to explore revolutionary ideas and results and exchange techniques, tools, and experiences. We invite the participation of all interested in this event, which provides insight into original research contributions relating to all aspects of #databaseengineering defined broadly, and particularly topics of emerging interest describing work on integrating new #technologies into #products and #applications, on experiences with existing and novel techniques, and on the identification of unsolved #challenges.
For the time being, we will have the honour of presenting an invited talk by Jim Webber from Neo4j, and we will also have an #EDI session organized by Laura Heels, so do not miss this! I will update you all via my social media for new exciting updates.
More information concerning the CfP can be found on the Conference Website: https://lnkd.in/dEFJ8dGZ. We are also welcoming Sponsoring Opportunities for companies, for which do not hesitate to contact me (more information on: https://lnkd.in/dPbu7H27)
Selected papers will be invited to an #MDPI Information Special Issue (https://lnkd.in/dXrM-zhE).
So, what are you waiting for? Plan ahead for your next paper. I hope to see you all soon in Newcastle!!!