Jan Domanski

Posts

Showing posts from 2012

Do the quarterly results add up to the yearly ones – sanity check for WIG20

October 02, 2012

Even when it comes to infographics, length can be an important consideration. So here it is: the comparison between reported, audited quarterly report and the yearly reports for the same period. Most of the times the agreement is good (median factor reported is 1.0); however, substantial under- and over-reporting does happen. As to why, I have no clue. To my mind, it doesn't sit right but the again I know nothing about finances. A friend of mine who's better versed in the matter was a little bamboozled by that as well.

How close are the quaterly reported results to the yearly ones (for KGHM)

September 30, 2012

Companies publish their results every quarter. These results are often a good indication how a company will perform in the year (if Q'1 data for 2012 are better than the Q'1 from 2011, that's generally a good sign). The Q'4 report is also published much earlier that the yearly report for a company, at least in Poland the gap can be a few months. So the question is do the combined Q'1-Q'4 results represent in essence the same information, as results published in the yearly report for which one has to wait, could one use the information in the summed Q'1-Q'4 reports to derive at least a good estimate of the yearly results? So here is a data for KGHM Table 1. Income (after tax) for KGHM with quarterly report data summed up for each year (YR and QT are the consolidated yearly and quarterly results; both are given in thousands of PLN) Year YR Q'1-Q'4 (sum) Fractional difference 2001 -838,625 -830,943 0.009 2002 -214,734 -216,299 -0.007

Pushing with python analysis of WIG

September 25, 2012

There is a number of simple questions about the financial statements of the companies listed on the WIG that I'd like to understand. One of these questions is: do the quarterly results (for Q1-4 of a given year) add up to the yearly results (of that year)? Getting the answer is difficult because 1. I don't own the data, 2. processing it is not super-straightforward. Today, I'm happy to report that hurdle 1. is down: scrapping Polish pages related to markets was made very simple by using a simple snippet import re from mechanize import Browser br = Browser() ... html_doc = br.response().read() html_doc = re.sub('content="text/html; charset=iso-8859-2"', "", html_doc) soup = BeautifulSoup(html_doc) Doing the simple 'sed'-like replacement solved all the issues BeautifulSoup had parsing the response I was getting from http://gielda.wp.pl

Python analysis of WIG

August 26, 2012

Two friends told me about their recent stock-trading experience. Having traded myself in high-school, I've had some sentiment and decided to see if I could make more rational decision that (the poor ones, admittedly) I've made then. (Technical diversion in gray) The first problem is that you don't "own" the financial statements data: by own I don't mean some intellectual property right – by own I mean having a file on my HDD with the data (be it a text file, db, hdf5, whatever) that one has the freedom to analyse. Scraping/HTML-parsing modules for Python came to help. Using mechanize, I was able to emulate a browser; BeautifulSoup was good for www.gpw.pl but failed for www.bankier.pl which then yielded to lxml. A simple question would be: is WIG under- or overvalued? To answer, take the P/E and P/BV values for all 400+ companies, bin them and draw a histogram. Fig 1. (A) Price-to-book value (per share) comparison, (B) Price-to-earnings (per s

Electorostatics artifacs in water simulations

August 25, 2012

The major computational cost in MD simulation is the calculation of non-bonded interactions between particles, especially the electrostatic ("minus-plus") interactions. There are several ways in which these interactions can be computed. The simplest one is a distance cut-off: only particles within x of each other will interact. More sophisticated methods, compute a grid onto which the particle charges are discretized for the whole system; the short-range interactions are computed within a cut-off, while the long-range interactions are computed to every grid point. The choice is important: simple models can give advantage in terms of computation speed, while more accurate models can save one from various artifacts. A prominent example of which, is the artificial ordering of water molecules in a simulation, which occurs when using the simple cut-off scheme for electrostatics (not shown). Figure 1, shows that if one inspects two properties of water in simulations genera