Hardware, Software & Product Development | Sparx EngineeringHardware, Software & Product Development | Sparx EngineeringHardware, Software & Product Development | Sparx EngineeringHardware, Software & Product Development | Sparx Engineering
  • Home
  • Expertise
    • Software Engineering
    • Electrical Engineering
    • Chemical Products and Services
    • Biomedical Engineering
    • Mechanical Engineering
    • Production Management
    • Automation
    • Industrial Design
  • Blog
  • Careers
  • About Us
NextPrevious

Recovering Images from Google Chrome Browser Cache

By Dheadley | Software | 8 comments | 19 March, 2013 | 0

I learned a cool trick over the weekend – recovering images from the browser cache, specifically from Google Chrome. That may seem a little obscure or unhelpful but believe me, it can be necessary. Right now you’re probably wondering to yourself “but why would anyone need to do that? I don’t really see the purpose here”. Valid question.

The ability to recover images from cache becomes a pretty valuable skill when you COMPLETELY ERASE A WEBSITE WITH NO BACKUP.

Who would do such a thing? Me. On Friday. In the course of testing a WordPress backup plugin I lost our new blog and all of the content – the irony of this is not lost on me. But enough about how we got here, let’s learn something from it.

But before you get too judgmental or start shaking your head, I offer up this link on StackExchange by none other than Jeff Atwood himself:

Unfortunately, our hosting provider experienced 100% data loss, so I’ve lost all content for two hosted blog websites:

  • http://blog.stackoverflow.com
  • http://www.codinghorror.com

(Yes, yes, I absolutely should have done complete offsite backups. Unfortunately, all my backups were on the server itself. So save the lecture; you’re 100% absolutely right, but that doesn’t help me at the moment. Let’s stay focused on the question here!)

Luckily I had a copy of all the text content in a WordPress backup XML, that I had generated from the standard Tools -> Export feature in WordPress. I also had a database backup from following the guidance from WordPress. That covered restoring blog posts, comments, etc., but I was still left without any of the images or galleries.

The options at this point revolve around cache copies, of which there are two main repositories (for me at least): Google’s search engine cache and browser cache.

To see what Google has cached for a specific URL use the following URL:
   http://webcache.googleusercontent.com/search?q=cache:

Which for this blog, would look like this:
http://webcache.googleusercontent.com/search?q=cache:www.sparxeng.com/blog

Unfortunately, Google didn’t have have much content for me to scrape. My last option was browser cache.

To see your cache files in Chrome type the following in the address bar:
    about:cache
or
    chrome://cache/

Google Chrome stores all cached data files in raw form, with the http headers intact. This means that the content you want is there, but you can’t just right click the link and save the file. What you see when you click the link is an HTML report file that is human readable but isn’t the final binary. Example graphic below is from a blog by the guys at Frozax Games:

Chrome Cache HTML Report

Luckily, there is one more free and easy online tool available to convert these HTML report files into the original binary form. The folks at Senseful Solutions have provided a browser-based conversion tool for rebuilding the binary file. You can copy/paste the html page into their web form and the file will magically appear below. Fantastic tool. Thanks guys.

Restore Image from Browser Cache

tl;dr
I erased all our blog images and restored them using browser cache and this link.

Backup your stuff!

backup, cache, Wordpress

Dheadley

More posts by Dheadley

Related Posts

  • Reading line-by-line from a serial port (or other byte-oriented stream)

    By Ben Voigt | 10 comments

    With many .NET developers moving from the traditional (and broken) System.IO.Ports.SerialPort DataReceived event handling to either the correct and more efficient BaseStream.BeginRead / BaseStream.EndRead pair I promoted in my last post or the newer BaseStream.ReadAsyncRead more

  • How to Model NPT Threads in Solidworks

    By rmontifar | 2 comments

    National Pipe Thread Taper or NPT threaded pipes and fittings are deployed in a variety of fields where transportation or containment of liquids, gases, steam, or hydraulic fluid is required. The NPT geometry allows internalRead more

  • Multi-Tiered Linux Backup System – Part I

    By dreynolds | 0 comment

    Backing up important data and memories is an important task that should not be neglected. Just as important as performing Linux backups is verifying that the backups made are good and can be used toRead more

  • Clojure: An improved workflow

    By dfohl | 0 comment

    Like many beginning Clojure programmers, I started off following Stuart Sierra’s “Reloaded” workflow guide. While it was a great starting point, there were a number of things that I wanted to change. If the projectRead more

  • Start Zoneminder Recordings with Vera Events

    By dsmoot | 4 comments

    In a previous post I explained how you could configure the security DVR software Zoneminder to trigger recordings from a network connection. While a neat trick, I never really explained why I set this up.Read more

8 comments

  • lyncastic Reply March 19, 2013 at 10:31 am

    i went through a similar problem before the days of google chrome. Luckily google’s search engine also keeps a pretty decent cache as well that I used for backing up things

  • Shawn Reply September 23, 2013 at 7:14 pm

    if the site has been archived / crawled in anyway you could try the “waybackwhen machine” located at http://archive.org/

  • dodge70 Reply November 11, 2013 at 2:33 pm

    cachecopy is your friend.

  • ferhat223 Reply June 9, 2014 at 2:20 pm

    thank you very very much for this helpful information. I really appreciate it :)))))))))

  • Me Reply January 6, 2016 at 5:28 am

    This is a great infor, I just spent an hour trying to find a cached image in Google after having deleted it from a forum post… Luckily I had it in my Chrome cache…

  • Merty Pat Reply March 22, 2017 at 4:23 pm

    Dont use such a light colored text, with the white in the back it’s really irritating to read.

  • U0 Reply March 24, 2017 at 10:23 am

    Agreed on the light colored text – VERY hard to read.

    Also, the senseful solutions thing no longer works, in case anyone’s wondering.

  • Gabe Reply June 1, 2017 at 1:36 pm

    Thanks so much for this article :D!!! Found an image that I lost thanks to your “webcache.googleusercontent…” tip :)!

Leave a Comment

Cancel reply

Your email address will not be published. Required fields are marked *

NextPrevious
  • Home
  • Expertise
  • Blog
  • Careers
  • About Us
Sparx Technologies, LLC. dba Sparx Engineering © 2009 - 2022 | All Rights Reserved
  • Home
  • Expertise
    • Software Engineering
    • Electrical Engineering
    • Chemical Products and Services
    • Biomedical Engineering
    • Mechanical Engineering
    • Production Management
    • Automation
    • Industrial Design
  • Blog
  • Careers
  • About Us
Hardware, Software & Product Development | Sparx Engineering