I am a data and AI entrepreneur and innovator with a long track record of working with the cultural sector and creative industries. As the founder of Reprex, a for-profit impact startup aligned with selected UN Sustainable Development Goals, I work with open data, open-source software, and open platforms. These platforms, such as the Open Music Observatory or OpenCollections, are designed to enable public–private partnerships that help independent, niche, and minority cultural content remain visible and fairly remunerated on global platforms. You can read more about my professional work and related scientific R&D activities on the Reprex website.
Throughout my life, I have maintained a strong interest in various forms of art. Coming from a family of musicians and photographers, I work professionally with music and have been actively involved in photography since the age of eight. I can create and restore both archaic and modern analogue photographs, as well as digital images and scans; I also work with locating photographs and collections, and interpreting their history and production. I enjoy reading and writing in my mother tongue, Hungarian. Thanks to excellent teachers and editors, I have published essays at a high level that I unfortunately cannot reproduce in English, the language I use most days. This attachment to language connects me to Finno-Ugric communities, who are kindred spirits using expressive and, to the outside world, somewhat hidden languages. This personal website is mainly devoted to these interests.
Chartered Financial Analyst, 2015
CFA Institute
M.Sc. Economic Regulation and Competition Policy, 2002
City University
M.Sc. Economics (Actuary Science & Applied Operational Research), 2001
Budapest University of Economics Sciences
The Finno-Ugric Data Sharing Space is a research-driven data sharing space for reconnecting fragmented cultural heritage across languages, institutions, and countries.

Ex ante and ex post grant evaluation

Fortepan is a privately run photography archive that has become a key institution of Hungary’s visual memory. As a long-time contributor, I played a central role in identifying and recovering the long-lost Főfotó negative archive—one of the largest photographic collections of socialist-era Hungary. Through research, provenance clarification, and collaboration with Fortepan and public archives, approximately 400,000 negatives were reintegrated into the public domain, demonstrating how dispersed photographic heritage can be reconstructed and reconnected to institutional memory.

Wikimuseum is an experimental curatorial concept developed with Wikimedia Eesti and Wikimedia Hungary, creating multilingual, co-curated exhibitions across Wikimedia Commons, Wikidata, and Wikipedia. It brings together dispersed cultural heritage—often never seen together physically—into structured digital exhibitions that connect institutional collections, private archives, and community knowledge. The project explores how open platforms can support collaboration between museums, researchers, and source communities, while addressing legal, ethical, and data governance challenges.

TextileBase helps scholars, museums, and businesses connect, share, and reuse data about textiles and dress history.

Connecting local bands with local fans, joining scenes across the globe.

Connecting local bands with local fans, joining scenes across the globe.

We want reduce data inequalities within Western and Eastern, Northern and Southern Europe, and contribute to a transparent data observatory that is inclusive for all.
Demo Music Observatory highlights from our blog
For our daily blog, please visit Data & Lyrics
My hobby is analogue photography. I have been making photos since my childhood. I develop and print my black and white photos. My collection of analogue cameras is ranging from half-frame cameras that use a film size of 18x24 mm to early 20th century large format 9x12 cm folders. I almost always carry a camera that is older than I, on working trips usually a tiny half-frame, and on vacation a large-format folding and several other cameras.

I am always happy to meet friends, fellow colleagues or collectors.
Find me on instagram or on flickr.

Our report highlights some important lessons. First, we show that in the era of global music sales platforms it is impossible to understand the economics of music streaming without international data harmonization and advanced surveying and sampling. Paradoxically, without careful adjustments for accruals, market shares in jurisdictions, and disaggregation of price and volume changes, the British industry cannot analyze its own economics because of its high level of integration to the global music economy. Furthermore, the replacement of former public performances, mechanical licensing, and private copying remunerations (which has been available for British rightsholders in their European markets for decades) with less valuable streaming licenses has left many rightsholders poorer. Making adjustments on the distribution system without modifying the definition of equitable remuneration rights or the pro-rata distribution scheme of streaming platforms opens up many conflicts while solving not enough fundamental problems. Therefore, we suggest participation in international data harmonization and policy coordination to help regain the historical value of music.

Our paper argues that fair competition in music streaming is restricted by the nature of the remuneration arrangements between creators and the streaming platforms, the role of playlists, and the strong negotiating power of the major labels. It concludes that urgent consideration should be given to a user-centric payment system, as well as greater transparency of the factors underpinning playlist creation and of negotiated agreements.

While the US have already taken steps to provide an integrated data space for music as of 1 January 2021, the EU is facing major obstacles not only in the field of music but also in other creative industry sectors. Weighing costs and benefits, there can be little doubt that new data improvement initiatives and sufficient investment in a better copyright data infrastructure should play a central role in EU copyright policy. A trade-off between data harmonisation and interoperability on the one hand, and transparency and accountability of content recommender systems on the other, could pave the way for successful new initiatives.

The topic of the paper is Library Genesis (LG), the biggest piratical scholarly library on the internet, which provides copyright infringing access to more than 2.5 million scientific monographs, edited volumes, and textbooks. The paper uses advanced statistical methods to explain why researchers around the globe use copyright infringing knowledge resources. The analysis is based on a huge usage dataset from LG, as well as data from the World Bank, Eurostat, and Eurobarometer, to identify the role of macroeconomic factors, such as R&D and higher education spending, GDP, researcher density in scholarly copyright infringing activities.

The goal of retroharmonize is to allow the organization of data joins or panels from various data sources, particularly survey microdata files, by retrospective harmonization the value codes, the value labels, and the missing value ranges of the data in a reproducible manner with the help of comprehensive s3 classes.

The paper analyzes a set of weblogs of one of the Library Genesis mirrors, provided to us by one of the administrators of the service. The weblogs contain records of individual book downloads from the period between September 2014 and March 2015. We use the date, the book identifier, and the geo-coordinates of the downloader included in the dataset to reconstruct the global black-market demand for scholarly literature. We then proceed to build a model to explain this traffic with various macroeconomic indicators on the global stage, and with economic, educational, R&D, and other cultural consumption indicators on NUTS-2 level in the European Union.

The goal of eurobarometer is converting Eurobarometer microdata files, as stored by GESIS, into tidy R data frames and help common pre-processing problems.

An open source R package for validating sub-national statistical typologies, re-coding across standard typologies of sub-national statistics, and making valid aggregate level imputation, re-aggregation, re-weighting and projection down to lower hierarchical levels to create meaningful data panels and time series.

The results of the first Hungarian, Slovak, Croatian and Czech music industry reports are compared with Armenian, Austrian, Bulgarian, Lithuanian, Serbian and Slovenian data and findings.

This study argues that the cultural and welfare benefits of this private copying regime are enormous and important to create a good quality of life in Croatia for all age groups, but especially for young people, and it must be maintained. Furthermore, it is very advantageous for the tech sector, because their products are mainly used with unlicensed music and film copies, given that only a very small portion of the population pays for downloads, or subscribes to services like Spotify, Deezer or Netflix. The first measurement of licensed use of music, audiovisual content, home copying and value transfer to media platforms in Croatia for a practical update of the private copying remuneration in the country.

Slovakia’s first music industry report. Following the three income streams model from creation till audience, we summarized for the the number of works that were created, recorded, staged in Slovakia in a year. We calculated their revenues, their value added, their employment effect and the investments of the recording industry. There is an extensive business development and policy conclusions chapter in the 227-pages report, which follows a similar Hungarian report.

Being visible in the world is always difficult in the Central and Eastern European region. Made in Hungary is the first book in the Popular Music Studies series of Routledge from the region. A description of our first datasets, the motivation of the research and the CEEMID concept is laid out as a closing, quantitative chapter in the book.

The goal of retroharmonize is to allow the organization of data joins or panels from various data sources, particularly survey microdata files, by retrospective harmonization the value codes, the value labels, and the missing value ranges of the data in a reproducible manner with the help of comprehensive s3 classes.

The goal of eurobarometer is converting Eurobarometer microdata files, as stored by GESIS, into tidy R data frames and help common pre-processing problems.

An open source R package for validating sub-national statistical typologies, re-coding across standard typologies of sub-national statistics, and making valid aggregate level imputation, re-aggregation, re-weighting and projection down to lower hierarchical levels to create meaningful data panels and time series.