Found a great Python code snippet for listing the most popular URL from Apache / NGINX ‘access.log’ file. Very practical!
import collections logfile = open("access.log", "r") clean_log=[] for line in logfile: try: # copy the URLS to an empty list. # We get the part between GET and HTTP clean_log.append(line[line.index("GET")+4:line.index("HTTP")]) except: pass counter = collections.Counter(clean_log) # get the Top 50 most popular URLs for count in counter.most_common(50): print(str(count[1]) + "\t" + str(count[0])) logfile.close()
The code is very handy if you want to find out the most popular URL or pages in your website, crucial information for optimization, IMHO.