Archivists Are Mining Parler Metadata to Pinpoint Crimes at the Capitol

Technologists are using scripts and tools to now pull noteworthy content from the huge Parler dataset.
January 12, 2021, 8:32pm
Capitol location data
Image: Motherboard

Using a massive 56.7-terabyte archive of the far-right social media site Parler that was captured on Sunday, open-source analysts, hobby archivists, and computer scientists are working together to catalog videos and photos that were taken at the attack on the U.S. Capitol last Wednesday.

Over the last few days, Parler was de-platformed by Amazon Web Services, the Google Play Store, and the Apple App Store, which has taken it offline (at least temporarily). But before it disappeared, a small group of archivists made a copy of the overwhelming majority of posts on the site.

Advertisement

While all the data scraped from Parler was publicly available, archiving it allows analysts to extract the EXIF metadata from photos and videos uploaded to the social media site en masse and to examine specific ones that were taken at the insurrection on Capitol Hill. This data includes specific GPS coordinates as well as the date and time the photos were taken. These are now being analyzed in IRC chat channels by a handful of people, some of whom believe crimes can be catalogued and given to the FBI.

"I hope that it can be used to hold people accountable and to prevent more death," donk_enby, the hacker who led the archiving project, told Motherboard on Monday


One technologist took the scraped Parler data, took every file that had GPS coordinates included within it, formatted that information into JSON, and plotted those onto a map. The technologist then shared screenshots of their map with Motherboard, showing Parler posts originating from various countries, and then the United States, and finally in or around the Capitol itself. In other words, they were able to show that Parler users were posting material from the Capitol on the day of the rioting, and can now go back into the rest of the Parler data to retrieve specific material from that time.

They also shared the newly formatted geolocation data with Motherboard. Motherboard granted the technologist anonymity to speak more candidly about a potentially sensitive topic.

capitol_map.jpg

Some of the plotted Parler GPS data. Image: Motherboard

The technologist said that, to at least some extent, since this data shows the use of Parler during the Capitol raid attempt, "that's a piece of the overall puzzle which someone, somewhere can use."

"It's definitely to help facilitate or otherwise create another exposure that the public can consume," they added, explaining their motivations for cleaning the Parler data.

Advertisement

This particular technologist did not distribute their version of the data more widely, however, with the aim of preventing abuse and misuse of the data.

"Sure, the source data are already public. But that doesn't mean I have to add an even easier path to data misuse," they said.

"For this Parler data, it would clearly not be correct to say 'every single user is a Nazi' and so by complete disclosure you are enabling someone who WOULD hold such a narrative to make bad choices and take bad actions if they wished," they added.

Do you know anything else about the Parler data? We'd love to hear from you. Using a non-work phone or computer, you can contact Joseph Cox securely on Signal on +44 20 8133 5190, Wickr on josephcox, OTR chat on jfcox@jabber.ccc.de, or email joseph.cox@vice.com.

Earlier on Tuesday, an analysis of the metadata by Gizmodo also showed that Parler users made it into the Capitol.

Others who have managed to get their hands on the Parler data have begun to make lists of videos and photos that have GPS coordinates on Capitol Hill, and have written scripts to pull those videos from the broader dump so people can analyze them. On an IRC chat channel, a small group of people are watching and analyzing videos and are posting their video IDs and description into a Google spreadsheet called "Notable Parler Videos." One description reads: "at the capital, pushing police, guy in MAGA hat screaming 'I need some violence now.'" A description for the IRC channel includes a link to an FBI tip line specifically targeted at identifying people at the riot.

One open source project calling itself Parler Analysis has collected different tools from around the web to handle the data in different ways. One is used to scrape usernames, for example, while another is for extracting images and videos, and yet another is an alternative cleaned dataset of cleaned Parler geolocation coordinates in a different format.

Subscribe to our cybersecurity podcast CYBER, here.

Advertisement

Motherboard Presents: Humans 2020

We are honoring 20 scientists, engineers, and visionaries who helped make this dark year a little brighter.
December 4, 2020, 2:00pm
​Image: Michelle Urra
Image: Michelle Urra
Honoring scientists, engineers, and visionaries who are changing the world for the better.

2020 has been a year of such intense upheaval and escalating tragedy that it has become a kind of temporal fall guy on which to pin the ills of the world, even if they existed before the pandemic and will continue after. 

Advertisement
face mask Kamenya Omote
All photos courtesy of 

Shuhei Okawara / Kamenya Omote

Culture

This Man is ‘Buying’ Faces to Make Creepily Realistic Masks, and People Want In

Japanese mask shop Kamenya Omote is paying people close to $400 to use their face to 3D-print lifelike masks.
December 1, 2020, 1:35pm

There is something quite creepy about the idea of looking at someone with the same face as you. No one knows for sure how they would react if the situation were to arise. Some people believe if you were cloned, you probably wouldn’t recognise yourself. But Japanese mask shop Kamenya Omote is now giving people the chance to experience this. Not just that, the mask shop is also paying people to “sell” their faces, which would go on to become masks that anyone around the world can buy.

Advertisement
A group of students  stand
Collage by Cathryn Virginia | Photo via Getty Images

Facial Recognition Company Lied to School District About its Racist Tech

Documents reveal Lockport Schools' facial recognition tech has mistaken broom handles for guns and has misidentified Black students at much higher rates.
December 1, 2020, 2:00pm

Ever since they learned that Lockport City School District intended to install a network of facial recognition cameras in its buildings, parents in the upstate New York community—particularly families of color—have worried that the new system will lead to tragic and potentially fatal interactions between their children and police.

Advertisement

Parler Finds Refuge With the Far-Right's Favorite Webhost

Epik, already used by Gab and other far-right associated social media apps, just took on Parler.
January 11, 2021, 9:49pm
GettyImages-1225872885
Amid rising turmoil in social media, recently formed social network Parler is gaining with prominent political conservatives who claim their voices are being silenced by Silicon Valley giants. (Photo by OLIVIER DOULIERY/AFP via Getty Images)

Epik, a Washington State-based internet webhosting safe haven for far-right websites, has given refuge to Parler after it was banned by several major tech companies.

With users as wide-ranging as Texas Senator Ted Cruz and neo-Nazi terror groups, Parler was recently booted from Amazon’s webhosting service, Google Play, and the Apple Store, after it was accused of being associated with the online organization preceding the violence on Capitol Hill last week.

Advertisement