Description of the Data ========CONFERENCES========== The conferences chosen are: - All conferences or symposia (6) we have found in CS or Engineering Education with the information (table of contens) of at least 10 editions. - ICER: a mixed (education-research) conference - Six conferences in research in CS. Criteria: al organized by the ACM (easier to process if there is only one format for the information); similar size to the ones in CS-Eng; similar age to those in CS-Eng. We have limited ourselves to 16 editions. If the conference had more than 16 editions we have chosen the most recent 16 (as of summer 2011). The list of conferences and editions studied are: ####EDUCATION ACE: Australasian Computing Education Conference. 12 editions: 1996-98;200;2003-2010 FIE: Frontiers in Education. 16 editions: 1996-2010 ITiCSE: Integrating Technology in CS Education. 16 editions: 1996-2011. Jenui: Jornadas de Ense–anza Universitaria de la Inform‡tica (Spain). 15 editions: 1997-2011 SEE: Conference on Software Engineering Education and Training. 16 editions: 1996-2011 SIGCSE: Technical Symposium on Computer Science Education. 16 editions: 1996-2011 ####MIXED ICER: International Conference on Computer Education Research. 7 editions: 2005-2011 ####RESEARCH ASE: International Conference on Automated Software Engineering. 14 editions: 1997-2010 ASPLOS: Architectural Support for Programming Languages and Operating Systems. 15 editions: 1987,89,91,92,94,96,98,2000,02,04,06,08-11 ICSE: International Conference on Software Engineering. 16 editions: 1996-2011 ISCA: International Sympsium on Computer Architecture. 16 editions: 1996-2011 SIGMETRICS: International Conference on Measurement and Modelling in Computer Systems. 16 editions: 1996-2011 SPAA: Symposium on Parellism in Algorithms and Architecture. 16 editions: 1996-2011 Of the documents in the proceedings we have chosen articles and posters. We have discarded panels and other special events. We have not used these as they do not have a standard rreporting (sometimes only the moderator of the panel appears, others, all the panelists) and most importantly because they do not represent collaboration in the same sense as collaboration in a paper. For instance panelists might not know some of their colleagues before the panel and meet them again after the panel (that has been my experience). =========File description========= There is a folder per conference with the following files. All textual data is in UTF-8. - "Code.bib" This BibTeX file that contains the bibliography information with a unique bibkey to identify the paper and the authors coded as four digit numbers with a prefix depending on the conference (ItA for iticse, Aut for Jenui, ICA for ICER, SEA for ICSE, SPA for SPAA, ASA for ASE...) -"A.bib" This file contains the bibliography information with the authors' names. Some authors use several names when signing their papers. In this file we have chosen the most descriptive name. There is only one name per author. Used for convenience, not for computation. -"au.csv" This file contains the number of papers published by the author in the conference per year, plus the total. It has a header line and is separated by tabs. The authors are ordered by code numbers. The code ID of the author is not part of the data. -"grph.csv" This file contains the graph information. Each line represents an arc and contains the two authors that are connected, the bibkey of the paper that connect them, and the year of publication -"AuCod.csv" This file contains the author code followed by the chosen unique name for each author. Separated by tabs, no header. -"pap.csv" This file contains the paper information: bibkey of the paper, the number of coauthors for the paper, and the year of publication. Separated by tabs, no header. -"tit.csv" Contains the key and the title of the paper. Not used. Is added only for convenience. Separated by tabs, no header. - "script.R": The R script to compute some data