Category Archives: Bachelor Thesis Stefan Kasberger

Who are the Inhabitants of the World Wide Web?

At our weekly Data Science team meeting, a few weeks ago we watched the talk “Everyday Life in a Data Rich World” of Jon Kleinberg, where he lectured about the evolution and the structure of social relations in big scale networks.


[zotpressInText item=”{7KGRGQUT}”]

What caught my attention most in this excellent talk, was a quote used by him right at the beginning:

“The emergence of cyberspace and the World Wide Web is like the discovery of a new continent.”
— Jim Gray, 1998 Turing Award [zotpressInText item=”{J3DKC5SH}”]

This metaphor stimulates my thinking till today and from now and then, new questions arise out of it. Following the metaphor of “the WWW as a new continent”:

  1. Who are the people living on it?
  2. Where do they come from?
  3. Why are they there?
  4. Who does this new continent attract most and who is excluded from living on it?
  5. What are they doing there?
  6. And finally: What are the interests of science in all this?

Right now, I don’t want to get too deep into discussing it, but some things already seem quite obvious:

  • This is historically the first time scientists get access to such huge amount of data about an global “settlement” activity.
  • As separated this continent is on one side, as connected it is from another point of view.
  • It attracts and excludes some people more than others.
  • The inhabitants of the continent are not representative to the society in general.

But anyway, watch the talk and make your own thoughts (would love to hear about them in the comments section).

References

[zotpressInTextBib style=”apa” sort=”ASC”]

Reflections on the Computational Social Science Wintersymposium

On 1st of December the first Computational Social Science Wintersymposium took place at GESIS in Cologne. Here some of my impressions about my second scientific conference.

The topic of the CSS Wintersymposium was “Understanding social systems via computational approaches and new kinds of data”. The day was well visited with about 115 participants – one third from Computer Science, another third from Social Science and the rest from all kinds of disciplines. Funny-wise, I even met two Environmental System Scientists from Osnabrück, something that is not very likely in this relatively new discipline with just a few faculties teaching it.

The conference was introduced by Markus Strohmaier, Head of the Computational Social Science Department here at GESIS and my advisor. During the day several presentations, a Pecha-Kucha and a poster session followed, before it ended in a casual evening in a typical Cologne-style brewery. But let’s take it step by step and get a little bit into the talks which inspired me most (all of them were recorded, as you can see in the playlist below).

Dirk Brockmann: The hidden geometry of complex, network-driven contagion phenomena

Dirk Brockmann from the Research on Complex Systems Department at the Humboldt University Berlin, presented results from his recent work. One of the most impressive work showed was his network analysis about the 2014 Ebola Outbreak [zotpressInText item=”{EAPXQN5N}”], which will stay in my mind for some time. It showed a very useful case, where science delivered desperately needed knowledge to solve real world problems with a huge impact.

Diseases were for me not new, they are a very common usage of Systems Science. In Graz, teachers often used it to show dynamic and complex behaviour of interdependent systems in mathematics, next to the usual predator-prey-models. But mostly it were about formal theoretical thinking or solving hypothetical or long-gone problems, but rarely about an issue that is popping up on my daily news-stream and to which I feel directly related. This experience of connecting my small world of scientific thinking with a global issue was a very inspiring experience and again brought meaning into my thinking about “what do I want to do as a researcher?”. I’m pretty sure, this experience will pop up over and over again to evaluate the relevance of my activities in the future, no matter if in- or outside of science.

Ciro Cattuto: High-resolution social networks from wearable devices

Ciro Cattuto gave a very lively talk about different works of him and his team. As an Open Science enthusiast the SocioPatterns project caught my attention:

…the collaboration supports the development of the SocioPatterns sensing platform, which uses wireless wearable sensors to gather longitudinal data on human mobility and face-to-face proximity in real-world environments. The SocioPatterns team also works on developing tools and techniques to represent, analyze and visualize the collected data.
— SocioPatterns.org

In the study Gender homophily from spatial behavior in a primary school: a sociometric study [zotpressInText item=”{GNGUZC6T}”] this type of data-collection was used to follow children (6–12 years) in a French primary school and measured how long they communicated to each other. The study showed some interesting differences between girls and boys in their development of gender homophily in the early age.

Most of the hardware and software developed for SocioPatterns.org are accessible, but unfortunately they use the Creative Commons BY-NC-SA as license – so not really open, but still the right direction.

Pecha Kucha

Fabian Flöck and me then moderated a Pecha Kucha session with the contributors of the 32 posters. Each talk had exactly 2 minutes time, which we demanded precisely. 😉

The session was really dense and fast, and I think it delivered the main goal quite good: it connected researchers based on their work and interests to being able to get in touch afterwards.

Frank Schweitzer: Modelling emotional agents – Data, Interaction, Simulation

The closing talk, and for me the most interesting one, was held by Frank Schweitzer, a Full Professor for Systems Design at ETH Zurich. He summed up the day and made some clear statements about challenges in the field of Computational Social Science, as the picture below shows.

https://twitter.com/ph_singer/status/539468291522437120/
Copyright: Philipp Singer

I’m thinking a lot about the scientific and socio-cultural dynamics between the Social and Computer Science communities – the problems, the challenges and mostly the many opportunities I see. But more on this sometimes later in an own post.

Resumee

The day was a good balance between heavy input, social interaction and having a nice time. Sounds normal, but often events lack in one of these areas. Too much information can make knowledge processing and memorizing hard and social interactions unease. The positive atmosphere and the good organization made this possible and lead to new connections between researchers from different communities.

For next year I would like to see more discussions about what Computational Social Science is and the different approaches in the intersecting disciplines – getting a little bit out of the comfort zone and creating some constructive friction. A practical part with workshops around questions like “which programming language should I use”, “how to present, share and publish my work best” or “can you show me how to version my code, data and documentation” would be very valuable, especially for early career researchers. Also more discussions about methodological aspects, especially on the link between data design and evaluation would be interesting.

So it is easy to say, the best improvement would be to extend the format and make it 2015 several days. See you hopefully there.

References

[zotpressInTextBib style=”apa” sort=”ASC” download=”yes”]

It will be about knowledge production on wikipedia

So, after some talks and research, the topic of my bachelor thesis has been narrowed down a little bit. Together with Fabian Flöck I will look into the mechanisms of collaborative knowledge production on the web, specifically on Wikipedia. Collaborative online plattforms with masses of contributors offer many new phenomena to look at, which has never before been accessible for researchers. Questions like: Who defines what is right? By which criteria will a contribution be accepted or reverted? How does the organization of knowledge and their contributors evolve over time? Are there people, who “own” knowledge? Why do people agree or disagree? How do disputes evolve? Which criteria leads to better quality of the article?

How knowledge appears, evolves and gets communicated are questions coming deeply from my philosophical hassle about what truth is. Especially the last year, as part of my philosophical courses I took at the IFZ in Graz (Einführung in die Technikphilosophie, Technik – Ethik – Politik), these question reflamed and matured through engaging with thoughts from Heidegger, Latour and Nietzsche.

The specific questioning is of course still open, but it will be something around how do editors in Wikipedia work more or less good together to write encyclopedia articles. Looking for specific patterns (they are everywhere¹) on the evolution of an article and the contributions of editors behind it.

Lots of questions now in my head. So next I will try to dive a little bit into Wikpedia and existing literature about it to get an understanding of actual questions and the state of research.

Sources

Thumb
Title: Martin_Heidegger_for_WP
Author: Herbert Wetterauer
Source: https://commons.wikimedia.org/wiki/File:Martin_Heidegger_for_WP.jpg
License: Creative Commons Attribution-Share Alike 3.0 Unported

My residence at the GESIS institute in Cologne

The next four months I will live in Cologne and write my bachelor thesis at GESIS. Why that and what is this? Here are the answers.

GESIS Köln

GESIS Köln by Stefan Kasberger (CC BY 3.0 AT)

So, here I’m, living since two weeks in Cologne. Whenever I told friends, that I will go for my bachelor thesis to another country, most people asked me “Why that?”, cause it’s not that usual. So, here my answers:

1. My advisor is Markus Strohmaier (@mstrohm), the scientific director of the Computational Social Science department and lead of the Data Science team here. He is a well reputated and active researcher in the field of data science with focus on social questions and an interdiscplinary approach. Our connection goes back to some talks about Data Science, especially Network Science back in 2012, and his drive to solve problems and focus on research impressed me from the beginning – unfortunately that is something rare in Austria. So, when he surprisingly asked me if I want to come to Cologne for my bachelor thesis, the answer was quite easy.

2. The GESIS – Leibniz-Institute for the Social Sciences is the largest infrastructure institution for the Social Sciences in Germany. It is based in the heart of Cologne, not far away from the Kölner Dom and the main train station, something very pleasant. I have joined there the Data Science team, where problems at the intersection of Computer Science and Sociology are being adressed. I’m looking forward to learn a lot from the many professionals here and hopefully it gives me the opportunity to get deeper insights into how research is done and organized in 2014.

3. Cologne always had a very good reputation: liberal, open and very hospitable. So again, I took the opportunity to live in a city outside Austria, and my amazing experiences with city life and german culture the last semester in Berlin made the decision quite easy.

So, seems like all of the more abstract requirements stated in my first post are fullfilled. So let the games begin!

To be or not to be a scientist

The next 4 months will be fully dedicated to my first scientific work – writing a bachelor thesis in environmental systems science with focus on geography.

This is a very important step for me, here is why. When I thought about my bachelor thesis, it was always about “which method(s) I want to use”. This got more and more precise in the last two years, through learning machinge learning and data science methods mostly by myself and investing a big amount of spare-time in this. So I was pretty sure it will be something around machine learning and/or network science, maybe also about understanding natural language through computers – cause it makes fun and offers a sense of exploration and innovation to me.

The topic itself is secondary, but still really important. It should deepen my knowledge in some of the areas I’m interested in, like geo-politics, urbanism, poverty, openness, knowledge creation, resources or migration. It should be nothing less than relevant, potentially emancipatory and contribute to a more just society.

And of course, to be able to make it open is an important point too. Plans are to share everything all along the way regularly. I will blog here frequently about my struggles, experiences and improvements and try to get a better understanding of how to open science. And as always, an own GitHub repository will be created of course.

Besides my scientific interests, the whole activity has an even more important point for my life as a whole. With my intense dedication to a bachelor thesis (4 months full time with the goal of a publication) I want to get a hands-on experience how the life of a researcher is nowadays and create something I can build upon in the future. At the end everything surrounds around the question: Do I want to live the life of an researcher in 2014? Until now, what I will do after finishing my study is still an open debate: research, working or changing my field of practice again totally.

And hopefully also my english will improve too. 😉