Are Pornhub statistics useful for social scientists?

I went on a little diatribe in another comment defending the basic descriptive stats that came out as not being obviously wrong, but I'm going to answer your question with a not very. It depends a lot on what data they would or could share.

The most obvious things that they definitely can do is tell us 1. How much Pornhub porn do people watch? 2. Which and which kinds of Pornhub porn videos are being watched?
3. What Pornhub porn is being highly rated? They could do some things to clean up and organize this data that would be more useful than what appears on the site. What good would it be? Basically, it would be good for content analyses of pornography. This is already very possible, though.

If they have and could share detailed individual-level histories with temporal information (deidentifed of course) those could be used to study quite a few questions about pornography consumption -- people might easily study addiction if they had that kind of data, for instance. Think about if that's the data you had-- what could you do with it? Quite a bit more than content analysis but you can't connect that behavior to much externally. It's unclear whether they have such data and whether they could/would share it. If they do have it, they have obvious reasons not to want people to be thinking about it.

How could it go beyond? What people could really learn a lot from is connecting individual level behavior on the site to other things-- e.g. to personality or demographic characteristics, to other sexual behaviors, to all kinds of things. They are both less likely to have this kind of data (info about the user independent of the site) and are going to be even less inclined to make it known they have it. But if they have it and share it, depending what exactly it is, that is what would truly be useful.

I wrote this because I had basically already written a memo like this for my boss because we have a shitload of GoogleAds data (which they gave us) but if it's all aggregate level stuff that I can't connect to individual users I can't do much that would be publishable. If I can see individual-level stuff, I can do a lot more. That would be enough-- but that is also limited if I don't know anything about those people. If I had that too, it would be a goldmine.

/r/AskSocialScience Thread