i have multi-lines of emails , need couple of things:
stephen.marquard@uct.ac.za louis@media.berkeley.edu zqian@umich.edu rjlowe@iupui.edu zqian@umich.edu rjlowe@iupui.edu ... etc - i need put them in 1 list:
['stephen.marquard@uct.ac.za','louis@media.berkeley.edu','louis@media.berkeley.edu'..etc] need figure out email repetitive within list, that's how startred code , hope complete ended code!
fname = raw_input("enter file name: ") if len(fname) < 1 : fname = "mbox-short.txt" fh = open(fname) lines = [] count = 0 # next step line in fh: line = line.rstrip() if not line.startswith("from ") : continue x = line.split() emails = x[1] #print y maxapperence = 0 famous = none mail in emails: count = emails.count(mail) if count > maxapperence: famous = mail print famous apparence = dict() mail in set(emails): apparence[mail] = emails.count(mail) print apparence]out put :
stephen.marquard@uct.ac.za louis@media.berkeley.edu zqian@umich.edu rjlowe@iupui.edu zqian@umich.edu rjlowe@iupui.edu cwen@iupui.edu cwen@iupui.edu gsilver@umich.edu gsilver@umich.edu zqian@umich.edu gsilver@umich.edu wagnermr@iupui.edu zqian@umich.edu antranig@caret.cam.ac.uk gopal.ramasammycook@gmail.com david.horwitz@uct.ac.za david.horwitz@uct.ac.za david.horwitz@uct.ac.za david.horwitz@uct.ac.za stephen.marquard@uct.ac.za louis@media.berkeley.edu louis@media.berkeley.edu ray@media.berkeley.edu cwen@iupui.edu cwen@iupui.edu cwen@iupui.edu
if you've got file contains email addresses:
import collections filename = '' c = collections.counter(map(str.strip, open(filename).readlines())) print(c.most_common(10)) # dumb example of possible output format
Comments
Post a Comment