RSS Feeds

domenica 13 marzo 2011

[Idea at proof stage] Using Facebook for Correlational Research

First of all I have to admit that I'm a very newbie in statistical science (let's say I know one or two things) and that all of the ideas wrote here are just speculation coming from my mind.

I'm quite obsessed in finding patterns in things.
I noticed that people looking alike have some similar behavior, same physical features.
In genetics with the term linkage we indicate the tendency of certain alleles to be inherited together because they are physically close to one another of because it's important for some reason that they don't assort independently.
Linkage is often used to discover genes related to diseases.
In my opinion we can extend its definition and use it not only for the purpose I've just mentioned above but it could be useful also for discovering protective factors for diseases.

My idea is to see correlation between people's medical history and their:

  • habits
  • sexual life
  • work
  • sex
  • age
  • family history
  • social condition
and for this purpose I want to use the largest database in the world of human beings: Facebook.

Since some months I have been thinking about how many data we put into social networks spontaneously and how this data, with relation to our connections (friends, parents and relatives), could form pattern that not a human being but a computer can understand and extrapolate (it's called Data Mining and I just discovered this today).

I still don't know how to sort this out, by now I plan to develop a survey Facebook application.
But with this approach there are some problems:
  1. Noise: people can answer questions with false information
  2. Appealing: people wouldn't reply questions for no reason. I have to make the application appealing and develop a game around it (e.g. Find your partner, Who looks like you, ecc...) 
  3. Harshness: Facebook it's just for fun and of course I can't ask: are you obese? are you rich? Questions have to be asked smoothly and indirectly: Do you like eating? What mobile phone have you got?
The application would also continuously monitor some information from users:
  • from status update it can get information about an user habits, what he eats, sports he practices (why not, even sexual habits)
  • from new options of geolocalization it can check places he attends (e.g. where he spends much of his time that can be a gym or a McDonalds)
  • it can even understand if he spends too much time on facebook from pc (so he has a sedentary life-style) checking how long is he online (and assessing he is active of course)
Everything about this idea is still a jumble in my mind and I hope to sort this out soon and whoever wants to be involved has just to comment this post.
Follow me on twitter for updates
By now I have the support from Moreno Colaiacovo (@emmecola) from MyGenomix who gave me some useful advices and I hope more people will join this project.

3 commenti:

Giovanni ha detto...

Hi,
it is an interesting project, but unfortunately it will be difficult to access the whole Facebook data.

I have heard saying that on the 23AndMeCommunity, some people are doing something similar, using their genotype data as well. Maybe you should have a look at the 23AndMe forums..

Alessandro Ferrari ha detto...

really?!?
sometimes ago I sent an email to 23andMe having no reply...
this was my email:

-----
Hello, my name is Alessandro Ferrari and I'm enrolled at MSc course in Medical Biotechnology at University of Bari.
I'm @mrgorefest on twitter and I'm sending this email in reference to this tweet.

Since some months I have been thinking about how many data we put into social networks spontaneously and how this data, with relation to our connections (friends, parents and relatives), could form pattern that not a human being but a computer can understand and extrapolate.
My idea is this: create a facebook application available to your customers who used your kits (whose genotype are in your database) which will continuously monitor some information from users:

* from status update you can get information about an user habits, what he eats, sports he practices (why not, even sexual habits)
* from new options of geolocalization you can check places he attends (e.g. where he spends much of his time that can be a gym or a McDonalds)
* you can even understand if he spends too much time on facebook from pc (so he has a sedentary life-style) checking how long is he online (and assessing he is active of course)

These are just some ideas of what you can take from the user.
Of course you will have a big background noise but you can have help from Google which knows this kind of problems.
Maybe my idea is just näive and just by a student but I think that to find correlations in some disease, social networks will play an important role because they can tell you so much about environmental factors which can be linked to disease. Considering that you already have the customer's genotype you already are a step further.
Let me know what you think about this idea, even if it's silly.
Thank you
Sincerely,

Alessandro Ferrari
-----

have you any link to the topic you are talking about?

Alessandro Ferrari ha detto...

p.s. yes, it's difficult accessing all facebook data and indeed that's not the way I want to do it. My idea is to develop an application who can access data relating to person that installs it. In this way you can access whatever information which the person allows you to.

Posta un commento