I'm analyzing a Apache access log file and I want to find the hit count.
Following code does it.:
ips = df.groupby('IP').size()
ips.sort()
print ips[-10:]
But I want to find the "Referrer" (the 9th column) of the top 10 ips.
How can I do this?
Sample log file line:
112.135.128.20 - [13/May/2013:23:55:04 +0530] "GET /SVRClientWeb/ActionController HTTP/1.1" 302 2 "https://www.example.com/sample" "Mozilla/5.0 (iPhone; CPU iPhone OS 6_1_3 like Mac OS X) AppleWebKit/536.26 (KHTML, like Gecko) Mobile/10B329" GET /SVRClientWeb/ActionController - HTTP/1.1 www.test.com