WGET

June 4, 2020

WGET, an indispensable tool for working with the web. Below are a few examples extracted from my CLI cheat-sheet, with explanation on syntax.

WGET & CURL: equivalent examples

wget -O index.html www.exampledomain.com

# The -O (upper case) is optional, and if omitted it usually saves as the index.html for the website.
curl -L www.exampledomain.com > index.html

# The -L follows all redirects before returning data.
# Curl normally spits out to the CLI, but the '>' redirects the named file.  Beware '>' overwrites, sometimes useful is '>>' to append.

Debugging

Sometimes it's really useful to be able to grab a website in its entirety.

wget -pHk www.exampledomain.com

-p  fetches all accompanying assets (images, css, js) to view the site
-H  enables recursive run, to fetch assets from other hosts
-k  after downloading, this will change all asset links to local/relative

Extra

du -sh $pwd

After a wget -pHk (in an empty directory), use du -sh $pwd to see the size of your website. I've found this to be a good statistic to keep track of for UX/mobile purposes. Though there's a lot to consider whether it's CSS, JS, or other and a large website doesn't necessarily mean it's slow.

· CLI