Bluesky, AI, and the battle for consent on the open web
Daniel van Strien, a machine learning librarian at Hugging Face, took a million Bluesky posts and turned them into a dataset expressly for training AI models: “This dataset could be used for “training and testing language models on social media content, analyzing social media posting patterns, studying conversation structures and