The data of our culture and society are increasingly being digitized and, to some extent, accessible. Actions and ideas are performed and articulated on various digital platforms, our daily life leaves behind long trails of digital footprints, and more and more cultural heritage material are accessible online. On the one hand, researchers face novel opportunities to study culture and society with massive data sets. On the other hand, researchers who scrape data from commercial platforms such as Twitter or from online forums like Flashback, not only tend to face different technical obstacles but also ethical problems. To this breakfast we discuss some important articles that takes an interest in these opportunities and concerns.