Archive.is blog

Blog of http://archive.is/ project
  • ask me anything
  • rss
  • archive
  • What happened to the links to Facebook and such?
    Anonymous

    Please provide more details about the problem. Nothing was changed.

    • 1 hour ago
  • Can't save from NYTimes, wget doesn't "push" the button for the full article. Got a fix for that?
    Anonymous

    Can you tell me the exact URL with this bug? I am unable to find it

    • 1 hour ago
  • The new Google Maps interface displays a big dialog in the middle of the archive. Could you please update your archiver to close the dialog?
    Anonymous

    Ok, fixed.

    Thank you for the bug report!

    • 3 days ago
  • Why did you change the URL back from archive-today to archive-is? Archive-today is a better name and URL.
    Anonymous

    I see two problems with .today: it is longer and the URLs are not clickable sometimes:

    image

    If there would be many users who prefer .today, I will change it back.

    • 1 week ago
  • Archives of trademark case details pages from the UK Intellectual Property Office are faulty. They show blank. It would be great for this to be fixed.
    Anonymous

    url?

    • 1 week ago
  • Would you consider handing over all the captured data to the Internet Archive, if you were to close down? They would likely make it publicly available.
    Anonymous

    Not sure if they will make it publicly available, looking at how they handle robots.txt (for instance, compare https://archive.is/www.chillingeffects.org and http://web.archive.org/web/*/www.chillingeffects.org).

    And I am not closing down.

    • 2 weeks ago
  • Are archive-today, archive-is and archive-li (replacing the dash with a dot) the only domains used?
    Anonymous

    yes

    • 2 weeks ago
    • #domain
  • Some pages like this one, 6TROi, are framed such that they have horizontal scrollbars that make it hard to read the page. Is there anyway to capture the page, or have it presented so that the horizontal scrollbar is not present?
    Anonymous

    I took new snapshot of this page (https://archive.is/F3Jxu) and it fits well with no horizontal scrollbar.

    • 2 weeks ago
  • Can you clarify how I get a page archived with text selected? So for instance on KdxNr I had Seattle selected when I used your bookmarklet, but it certain didn't get captured with Seattle selected. How do I add on the selections after a page has been captured?
    Anonymous

    http://blog.archive.is/post/95383347266/hi-i-use-firefox-17-linux-and-select-some-text

    • 2 weeks ago
  • I had an idea for a useful feature that could be added when "download zip" feature is used. I am running into an issue where zip files downloaded from archive-today are not trusted as they can be tampered with, but sites on archive-today itself are trusted. Would it be possible for archive-today to sign the archive with a gpg key when it is generated, and provide the user with the sig in addition to the zip? Then the integrity of the archive could be verified by checking it with your public key.
    Anonymous

    It is not easy due to the paradoxical fact that the snapshots are not stable enough. They are changing over time with the changes in the post-processing code.

    If you compare https://archive.is/Ho3nb/image and https://archive.is/Ho3nb you may notice that the former has the popup with warning about the cookies but the latter is not.

    At https://archive.is/Ho3nb you may notice that the images embedded in tweets (which are cropped on twitter.com until clicked) have their full height on the snapshot. As the snapshots are not interactive we have to make little changes in the webpage layout in order to make semi-hidden content visible without requiring action.

    One more example: at https://archive.is/53wIp the transcript is shown as though “READ TRANSCRIPT” button has been clicked.

    All this magic is performed by the post-processing code. What you see and what you can download in .zip-files are the output of the code. Every change in the code  would change the content and the control sum of all the .zip-files and would require all the .zip-files to be signed again.

    • 2 weeks ago
Next page
  • Page 1 / 17