Study of the Biggest Leak in History




Introduction to the Panama Papers

Panama Papers involved hundred thousands of offshore accounts. Millions of documents detailing top secret financial information were leaked in 2015 by an anonymous source. It is estimated that tax havens cost poor countries at least $170 billion in lost tax revenues each year making poorest people lose out and the inequality gap grow. Holding money in an offshore company is generally not illegal but it can be used to launder money, dodge sanctions, avoid taxes. Many people listed in the panama papers claimed being innocent.

Some relevant numbers related to Panama papers

  • 11.5 millions leaked documents that detail financial and attorney–client informations
  • More than 214,488 offshore entities involved.
  • Some 4.8 millions leaked files were emails, 3 millions were database entries, 2.2 millions PDFs, 1.2 millions images, 320,000 text files, and 2242 files in other formats.
  • 2.6 terabyte trove of data at the core of this investigation contains nearly 40 years of records.

A word about the dataset before starting

Unfortunately, we don’t have access to big part of the data and metadata from the original leaks. So, we decided to enrich our data with other similar datasets found in The International Consortium of Investigative Journalists (ICIJ): Paradise papers, Offshore leaks and Bahamas leaks. The pie chart below explains more the sources of the data we used in our analysis:

entities plot entities

We combined all the datasets together to get more information, and then structured them in 3 relevant datasets. Each one represents a specific role of involved parties:

  • Entities: is a company, trust or fund created in a low tax offshore jurisdiction.
  • Officers: a person or a company who plays role in an offshore entity.
  • Intermediaries: the link between someone seeking an offshore and an offshore service provider.

An other important dataset was used in our analysis which is :

  • Edges: they connect nodes from the parts above and describe the nature of the relation between them. This dataset was very important especially in network analysis.

Outline of the project

There is a huge amount of data. Our work was to provide a detailed analysis of the dataset. We didn’t focus our analysis on specific individuals. We tried instead to get a closer view about the involved countries and the distribution of the papers around the world.We will show you a brief presentation of the involved countries and the connection between them. Finally, we looked for any correlation between the implication of each country in the panama papers with international indexes. We will try to explain the correlation found between Financial Secrecy Index (FSI) and the countries.

Research questions

  • What is the distribution of officers, entities and intermediaries around the world? Is it based in specific countries or is it overspread?
  • Based on network analysis, how can we describe the relationship and the connection between those countries?
  • Can we relate our rankings to other world known indicators such as Financial Secrecy Index (FSI)?

We will try to explain the phenomenon by analyzing the datasets in order to understand how much tax evasion is a very important tool for rich people. Our role is not to judge people or jurisdictions. We are just analyzing and interpreting the fact and maybe pushing towards the implementation of new tax regulations to prevent such aberrance.


Involved countries

The worldwide distribution of the three main kinds of actors (which are Offshore entities, intermediaries and officers) is an important thing to analyze. A general idea about the involved countries may give us a general idea about Panama papers. We noticed after the analysis that the three of them differ from each other on a global scale even though some countries are “well” represented in all three categories.

Entities

entities plot entities

Regarding the offshore entities, the presence of economic global powers is clear : US and Russia. But there are other efficient hosting countries. Indeed, in Europe, Switzerland and Malta are leading with a very huge number comparing to other countries followed by Great Britain and Luxembourg. In South America, Panama has a large capital of offshore entities. In western Asia, United Arab Emirates is also leading. It is noticeable that a lot of top countries involved belongs to offshore financial centers according to FMI, which is very logic. Companies look for low tax offshore jurisdiction to maximize their benefits.

Intermediaries

entities plot intermediaries

To fulfill a tax evasion, the offshore companies also need a go-between with a service provider, these intermediaries usually are law-firms, banks or middlemen that ask an offshore service provider to create an offshore firm for a client. The Intermediaries are once again most located in Hong Kong, Great Britain and Switzerland , as they are known for the privacy of their bank services. Still, the distribution of the intermediaries is very similar to the offshore entities distribution on a global scale. The intermediaries involve mainly in the phases of the identified decision-making cycle (advice, creation, maintenance, enforcement).

Active intermediaries

entities plot entities

Intermediaries play an important role in decision making, which involves more people and juridictions related to Panama papers. Mossack Fonseca worked with intermediaries in more than 100 countries all over the globe. The most active clients were from the United Kingdom, Hong Kong and Switzerland.

Officers

entities plot officers

Regarding the officers, It is noticeable to say that most countries involved in entities are present here, which is very logic. But, other countries appeared here, especially Germany, Italy and Russia. It means that officers may have a relationship with companies in those countries. We will explain that in the next part.

40 years of offshore companies opening and closing

entities plot entities
  • Incorporation year: the date when an offshore entity was created.
  • Inactivation year: the date when a client told the agent to deactivate the offshore entity which could be recreated at a later date.

The leaked files from Panama-based law firm Mossack Fonseca contain nearly 40 years of data. Mossack Fonseca’s clients have been rapidly deactivating companies since 2005. The number of incorporations of offshore entities has been decreasing rapidly since 2003.

To conclude, countries involved in Panama papers are around the world. The offshore entities are not related to particular countries. Companies, businessmen, football players and even politicians all around the world had an issue with Panama papers. Here is a map that summarizes the distributions of countries:

PS: We chose to present this map in a log scale to be more significant because the gap is very large between the countries.


Network

While it’s true that investigating simple relationships in data could be handled by a relational database, they’re not an especially satisfactory fit, as they represent data as tables, not networks. The ICIJ organization has been using graph database technology to draw links between members of the tax avoiding global elite and bank account held in offshore tax haven. A network analysis will be held to understand these structures and discover connections in a very intuitive way.

In order to get valuable information from the network we constructed, we tried to use some algorithms to understand the importance of nodes according to their characteristics.

Importance of nodes

  • Degree centrality: This algorithm puts into light the countries that have the biggest number of connections. This number can be interpreted in terms of the immediate risk of a country being involved in questionable business.

  • Page rank: This algorithm computes the ranking of the countries in the graph based on the structure of the incoming links. A country is important if it is pointed by other important countries

  • Betweeness centrality: The betweenness centrality for each country is the number of the shortest paths that pass through the country. High betweenness countries are often critical to collaboration across different groups.

Those metrics were the tool that we used to determine the top three most important countries in the network: United Kingdom, Switzerland and Hong Kong.

This map represents the distribution of degree centralities of the nodes (countries). It shows us the importance of European countries in the leaks. There is no surprise here since the European Parliament’s Panama Papers Committee of Inquiry was heavily accusing European governments to have been complicit with money launderers and tax evaders for over 20 years.

Importance of edges

Another interesting result was obtained by the use of the edge betweenness centrality. One could think that the most important edges would be between the countries mentioned above. But, instead of that, the most important ones where between first world countries and third world countries. The most important edge in our network was the one between Switzerland and Mali.

Since switzerland seems to be an important player in this network, the following map shows the connections of the entities based in switzerland and other world countries:

entities plot entities

The result is not surprising again since the paper revealed that 1,339 Swiss lawyers, financial advisors and other middlemen had set up more than 38,000 offshore entities over the past 40 years.

As a consequence, the inter-governmental Financial Action Task Force (FATF) called on the Swiss to do more to thwart financial crime. However, the EU also has been pushing a new Anti-Tax Avoidance Package. It is now introducing a new rule that requires the largest companies operating in the EU to disclose key information on where they make their profits and where they pay their tax in the EU on a country-by-country basis.

In order to get a more intuitions about the graph connections and relation we decided that we will create a weighted graph from countries where each link represent the number entities linked to countries:

To make our graph interactive and highly sophisticated we decided to use Graphistry. This library enables visual investigation of the graph, by putting the mouse on the node or the edge we will see all there attributes.

It also allow us to extract a lot of mathematical relations such as histograms (degree of nodes, edges in, edges out …).

This graph represents the weighted projection of the bipartite graph, here we take into account the number of commun associations between countries as weights. It is clear that again some countries dominate the graph such as Switzerland, Hong Kong, Singapore ..


Financial Secrecy Index (FSI) and involved jurisdictions

Our analysis has not stopped in just analyzing the involved countries and the connection between them. We also wanted to find any correlation between the implication of each country in Panama papers with official indexes published by world wide organizations. We studied the correlation between Financial Secrecy Index (FSI) and countries involved in Panama papers.

The Financial Secrecy Index (FSI) ranks jurisdictions according to their secrecy and the scale of their offshore financial activities. It is a tool for understanding global financial secrecy, tax havens or secrecy jurisdictions, and illicit financial flows or capital flight.

We studied the correlation between the ranking of countries according to the financial secrecy and the ranking of involved countries in Panama papers. The results were relevant: we found correlations between 0.6 and 0.7. That means that the countries having a good ranking regarding Financial Secrecy Index (FSI) attract more offshore entities, which can be explained :

A company or private individual may create an offshore in countries where, preferably, regulations are weak, for example, no need to name the owner. These companies are usually limited liability companies and don’t conduct any business, they just own financial assets of their owner. In fact, this is the main point of Panama papers involved officers. Those countries are basically offshore center that provides financial services to nonresidents on a scale that is incommensurate with the size and the financing of its domestic economy. Secrecy jurisdictions use secrecy to attract illicit and illegitimate or abusive financial flows.


Conclusion

The Panama Papers expose the internal operations of one of the world’s leading firms in incorporation of offshore entities. Politicians, football players and busniessmen all around the world were involved. Even if the offshore entities are legal, politicians and people in charge should revise this phenomon because corruption and money laundring can be very easy in a such environment. In fact,corruption and inequality feed off each other. They create a vicious circle between corruption, unequal distribution of power in society, and unequal distribution of wealth. As showed in the Panama Papers, it is still too easy for the rich and powerful people to exploit the loophole of the global financial system to enrich themselves at the expense of the public good.


Full project

The full project is available on a github repository. It provides 2 Jupyter Notebook. A first one for cleaning the data from the different datasets downloaded from The International Consortium of Investigative Journalists.The second one is the main project where we analysed the data, did the network analysis and studied the correlations. If you want more information about our project, please contact us via email (you can find them in About section)