Extract email addresses from Webpages

Standard
    1. Extract the list of links from where you intend to extract email addresses from:
      Use a tool like chrome extension like Link Klipper
    2. Assuming that the filenname with the links extracted is urllist.txt, run the below command.

for site in `cat urllist.txt`

do
wget -q -l1 -t1 -T60 -O - $site | grep -E -o "\b[a-zA-Z0-9.-._]+@[a-zA-Z0-9.-]+\.[a-zA-Z0-9.-]+\b" | sort | uniq -i >> emails.txt
done;

you can adjust -l1 flag to determine the depth of your search.

Leave a comment