Habrastatistics: we analyze readers' comments. Part 2, answers to questions

Hi Habr.



In the previous part , the messages of users of this site were analyzed, which caused a rather lively discussion on the topic of various parameters (number of messages, rating, “karma”, etc.). Such questions have accumulated enough to make the second part.







Those who are interested in what is the length of the largest discussion in the comments for this year, what the maximum and minimum “karma” of users can be, and other statistics, I ask for cat.



Data collection



, , . HTML, «» , . , , .



( ):



https://habr.com/ru/post/322900/,comment_19707920,comment_19706258,UserXXXX,karma:112.2,answers:1,2019-02-04 20:26:00,rating:1,up:1,down:0, ?

https://habr.com/ru/company/mailru/blog/351212/,comment_19794710,comment_19794310,UserXXXX,karma:-10.0,answers:1,2019-02-23 18:16:00,rating:3,up:5,down:-2,

...







, answers ( ), «» , .



«» , , :



def get_karma(user: str):
        data_html = get_as_str("https://habr.com/ru/users/%s/" % user)
        karma = data_html.find_between('info/help/karma/', '</a>').find_between('stacked-counter__value', '/div>').find_between('>', '<').replace(",", ".").replace('–', '-')
        return float(karma) if len(karma) > 0 else 0.0

      
      





, , rocket science. HTML .



CSV- 2019 334. . , , . , .





, « » , . , «» , . , , , , .



, 25109 — , . 9973 (39%). 12346 (49%), <=4 5384 (21%), >= 40 ( « » ) 1522 (6%). 2790 (11%).



:







— . , .. (9973 2570 «» 1). «» , , . «» «» ?



-10 Zelenyikot (+1509.2), Milfgard (+1471.0), m1rko (+1039.5), PatientZero (+986.0), Boomburum (+881.9), ValdikSS (+873.5), alizar (+837.5), tangro (+802.5), lozga (+764.7), DIHALT (696.1). , — good job, dudes :) - — , .



, , — 1000 ( , ), , .



-10 … , «», , . , , , , «» .



-10






.



-5 26, 17, 16, 15 14 .



:







, 100 .





: +218, +144, +141, +133 +124.



: -248, -170, -163, -131 -114.



, «» — , , «» . , .





. — HTML , data-parent_id, , , .



, 2019 : 619 , 618 , 614 , 556 553 . , 4 5 , , -20.



:







— . (41% 183000) , 75 1 , .





«» — 2019 , - .



, . - — , .



All Articles