downloading folders from google drive.

I wanted to download some course material on RL shared by the author via Google drive using the command line.  I got a bunch of stuff using wget a folder in google drive was a challenge. I looked it up in SO which gave me a hint but no solution. I installed gdown using pip and then used: gdown --folder --continue if there are more than 50 files you need to use --remaining-ok and only get the first 50. In such a case its best to download using the folder using the UI and decompress locally. Decompressing from the command line created errors related to unicode but using the mac UI I decompressed without a glitch.

Wikisym 2012

Due to a kind grant by the WMF foundation I was able to attend Wikisym 2012 in Linz Austria what follows is my report on the event.


English: The renovated Ars Electronica Center ...
English: The renovated Ars Electronica Center at Linz, seen from the bridge across the Danube at night (Photo credit: Wikipedia)
I am a Wikipedian based in Budapest Hungary. I have been active for the last year with WM.HU and participated in a number of the local  chapter's events ever since being introduced to them at Wikimania 2012 in Haifa. During such a meeting I, Bence Damkos, and other chapter luminaries got to discussing many apparent cultural paradoxes taking place in the virtual community. Since I was studying the theory of games at the time I began to notice that some of the situations were very similar to a classic game such as the prisoner's dilemma and the battle of the sexes while others resembled second-price sealed actions and bargaining games. I was intrigued and I started publishing some analysis on a page on Meta. At this time I came across some interesting ideas from another researcher, Jodi Schneider who introduced me to the field of CSCW (Computer Supported Collaborative Work) and to her area of research - the deletion process. Eventually, she suggested that I should attend a wikisym. However, I had no background in writing a conference paper I asked her for help and she copy-edited my work guiding me through a number of tricky issues.

I eventually submitted the paper and to my surprise, it was accepted. So I took a train to Linz - I was surprised when after boarding the train I had to reserve a seat and accordingly had to stand for the duration of the five-hour journey. By the time I arrived at the little town, it was late and I was exhausted. I took a bus and ended up in a hotel by the Danube.

At the Conference

On the morning of the conference, I took breakfast and met some of my favorite people - Maryna Pinchuk and Ryan Faulkner who was preparing to give a paper on their work in running an editor engagement experiment - in which I had unwittingly participated. After a short chat, I made my way to the venue the Ars Electronica and I could not believe my eyes - the conference was hosted by one of the most amazing technology museums in Europe. In the evening, the building would completely dominate the riverside view with its digital animation installation.

The Conference began with a number of presentations. I was impressed by most of the presentation
CPoV Wikipedia Conference
 (Photo credit: Anne Helmond)
 but my sentiments were clearly not shared by everyone at the conference. I later learned that some of the more vehement voices were doctoral and students who were out to prove their mettle.

The papers that most struck my fancy used a number of novel techniques. Ranging from actuarial survival analysis through SNA to sentiment analysis.
Classifying Wikipedia Articles Using Network Motif Counts and Ratios by Guangyu Wu, Martin Harrigan, and Pádraig Cunningham was one of the hardest to understand. It used a novel SNA technique to classify Wikipedia articles. However, it seemed that the other participant did not like the level of detail that the researchers had provided. Dr Bernie Hoagan a Research Fellow from the Oxford Internet Institute asked the researchers why they had not tried to use ERGMs which might give more accurate results. I would later correspond with Dr. Hoagan and he helped me get started with Social network analysis.

A paper by  Michaela Ferron and Paolo Massa titled Psychological processes underlying Wikipedia representations of natural and man-made disasters. It showcased the use of sentiment analysis. I was aware of this method from my work in a Natural Language Programming Outfit in Israel for which I wrote a search engine for the Hebrew Wikipedia. But I had considered this technique very complex to set up. On reviewing the paper I realized that an off-the-shelf tool called LIWC (Linguistic IInquiryand Word Count) can do the job. LIWC was developed by a team led by James W. Pennebaker whose book "The Secret Life of Pronouns" is a gentle introduction to the intricacies of sentiment analysis. What remained difficult to grasp was a three-dimensional model of sentiment. I was unfamiliar with the terminology so I would end up rereading this paper a couple of times. But this was not the only paper to use sentiment analysis or natural language technology. Manypedia: Comparing Language Points of View of Wikipedia Communities by again by Paolo Massa, and Federico Scrinzi showed a tool that allows users to compare different language edition versions of the same article in their own language using machine translation. A second paper to discuss sentiment analysis, this time focusing on talk pages was: Emotions and dialogue in a peer-production community: the case of Wikipedia. This paper used an even more complex paradigm than the previous one. It utilized Margaret M. Bradley & Peter J. Lang's ANEW (Affective Norms for English Words) word list to create a three-dimensional model of sentiment (valence, arousal, and dominance). Even more interesting were its conclusions regarding the participation of women and its implication on Wikipedia's growing gender gap.

Heather Ford, Jimmy Wales, and Florence Devoua...
Heather Ford, Jimmy Wales, (Photo credit: Wikipedia)
I would discuss some of my ideas with some of the participants over dinner. One amusing debate included Stuart Geiger and when I quoted a point from an excellent paper he pointed out that he had written it. I also met with heather ford who co-authored a paper with Mr. Geiger. Heather Ford told us about her blog Ethnography matters which I started to follow because it turns out that ethnography really matters These include work by Stuart Geiger and on the lives of robots using trace ethnography.

During the conference, I met with Jodi Schneider but we had little opportunity to chat due to an upcoming deadline. I enjoy following her research on deletion as well as on Argumentation in collaborative deliberations. I decided to help Wikipedia's research newsletter by abstracting and providing laymen's summaries of CSCW-related research.

Panels, Demos, and Posters

Phoebe Ayers at WikiSym 2010
Phoebe Ayers at WikiSym 2010 (Photo credit: Wikipedia)
I found out that the Wikisym conference had a colorful history and participated in a discussion mediated by the delectable Phoebe Ayers on the conference's future. I suggested that the conference should be collocated with Wikimania since this would help reduce the cost of community members who attend the Wikimania conference.  A second conundrum being debated was the issue of an open academy. This was an issue of growing urgency since the WMF, one of Wikisym's chief sponsors prefers to support open access open research work. I think that Phoebe Ayers is a wonderful person and was sad to hear she was no longer on the foundation's board of directors.

Another serendipitous facet of the Wikisym conference is the demo and poster session which allow hackers to present their latest breakthroughs and innovations in the technology, of Wikis. This had once been the cornerstone of the conference.  I met the developer of TikiWiki as well as a Java-based XWiki.

Jimmy Wales' Keynote Address

Wikisym's keynote was given by Wikipedia's co-founder Jimmy Wales. He explained how this talk was one of the trickiest he would give this year. However, this was a much better talk than he gave at Wikimania. He mentioned research possibilities and he responded to my question. I was and still am considering if population dynamics could affect phase changes within the community. My question was if a Wiki's community dropped below a certain size if it would no longer be viable to maintain it. One example of a Wiki being shut down was the 9-11 wiki. I found Wales's answer enlightening - he said that big or small the community should have little problem adapting to take care of its wiki. Another point worth mentioning was his recommendation to use Wiki data sets of smaller wikis in research. He recommended Muppet wiki as an example of a wiki with a significantly different governance structure than Wikipedia.

After the conference 

Following the conference, I kept in touch with a number of the participants. I applied myself to study social network analysis as well as data analysis with R. I increased my participation in the research newsletter. I hope to expand my research further using population dynamics on graphs and evolutionary game theory. However, with all the new research methods, I've gleaned. I am uncertain what direction my future investigations will take only that they will be even more exciting than before.


Popular posts from this blog

Moodle <=< Mediawiki SUL integration - first thoughts

downloading folders from google drive.

AWS CloudFormation Pros and Cons