Jason mraz dating tristan Free sexy chat with boys
In this case, the Twitter profiles of the authors are available, but these consist of freeform text rather than fixed information fields.
In the following sections, we first present some previous work on gender recognition (Section 2). Currently the field is getting an impulse for further development now that vast data sets of user generated data is becoming available. (2012) show that authorship recognition is also possible (to some degree) if the number of candidate authors is as high as 100,000 (as compared to the usually less than ten in traditional studies).
With only token unigrams, the recognition accuracy was 80.5%, while using all features together increased this only slightly to 80.6%. (2014) examined about 9 million tweets by 14,000 Twitter users tweeting in American English.
They used lexical features, and present a very good breakdown of various word types.
For gender, the system checks the profile for about 150 common male and 150 common female first names, as well as for gender related words, such as father, mother, wife and husband.
If no cue is found in a user s profile, no gender is assigned.