The Panama Papers - massive data leak revealing widespread corruption

☑︎#VoteDemocrat

The Original
Bushed
WOAT
Supporter
Joined
Dec 9, 2012
Messages
302,264
Reputation
-34,076
Daps
611,754
Reppin
The Deep State

No1

Retired.
Supporter
Joined
Apr 30, 2012
Messages
28,999
Reputation
4,599
Daps
63,699
So is this going to be coming out in streams?
 

☑︎#VoteDemocrat

The Original
Bushed
WOAT
Supporter
Joined
Dec 9, 2012
Messages
302,264
Reputation
-34,076
Daps
611,754
Reppin
The Deep State
God.

Damn.












The data
The Panama Papers include approximately 11.5 million documents – more than the combined total of the Wikileaks Cablegate, Offshore Leaks, Lux Leaks, and Swiss Leaks. The data primarily comprises e-mails, pdf files, photo files, and excerpts of an internal Mossack Fonseca database. It covers a period spanning from the 1970s to the spring of 2016.


_modules_4_image_url.png

Moreover, the journalists crosschecked a large number of documents, including passport copies. About two years ago, a whistleblower had already sold internal Mossack Fonseca data to the German authorities, but the dataset was much older and smaller in scope: while it addressed a few hundred offshore companies, the Panama Papers provide data on some 214,000 companies. In the wake of the data purchase, last year investigators searched the homes and offices of about 100 people. The Commerzbank was also raided. As a consequence of their business dealings with Mossack Fonseca, Commerzbank, HSH Nordbank, and Hypovereinsbank agreed to pay fines of around 20 million euros, respectively. Since then, other countries have also acquired data from the initial smaller leak, among them the United States, the UK, and Iceland.



The system
The leaked data is structured as follows: Mossack Fonseca created a folder for each shell firm. Each folder contains e-mails, contracts, transcripts, and scanned documents. In some instances, there are several thousand pages of documentation. First, the data had to be systematically indexed to make searching through this sea of information possible. To this end, the Süddeutsche Zeitung used Nuix, the same program that international investigators work with. Süddeutsche Zeitung and ICIJuploaded millions of documents onto high-performance computers. They applied optical character recognition (OCR) to transform data into machine-readable and easy to search files. The process turned images – such as scanned IDs and signed contracts – into searchable text. This was an important step: it enabled journalists to comb through as large a portion of the leak as possible using a simple search mask similar to Google.


The journalists compiled lists of important politicians, international criminals, and well-known professional athletes, among others. The digital processing made it possible to then search the leak for the names on these lists. The "party donations scandal" list contained 130 names, and the UN sanctions list more than 600. In just a few minutes, the powerful search algorithm compared the lists with the 11.5 million documents.

_modules_6_image_url.png
 
Top