Reaction
REACTION (Retrieval, Extraction and Aggregation Computing Technology for Integrating and Organizing News) is an initiative for developing a computational journalism platform (mostly) for Portuguese.
Announcements
- 2013-01-31 We made our REACTION workshops page public.
Resources
A list of resources developed and maintained by the project can be found here
About
News are no longer simply produced and consumed, but instead continually evolve over time as a cooperative dialog between news outlets and the public at-large. News presentation must fundamentally reflect this, providing anytime organization of the latest events, conveying how story elements developed over time, and integrating the story in the larger world context. In short, the days of simple online aggregation are over; the world has already moved on. While the idea of journalists using computers as information discovery tools goes back several decades, never before has computation been understood to be so tightly integrated with the core of journalistic practice. Journalistic excellence today requires advanced data mining and search technologies, together with novel web services and integrative mashups.
We identify the following important challenges facing the field:
- Automatic analysis of content, including news, blogs, micro-blogs, comments: detect and resolve references to named entities (e.g., public figures); tracking these entities and events involving them; assess quality (e.g., readability); infer polarity (e.g., sentiment); detect cases and patterns of re-use (e.g., via “memes” or larger units of similar text) and information flow.
- Automatic analysis of explicit and implicit social networks: infer implicit social networks based on information flow patterns involving content producers and consumers; discover communities; infer authority and credibility of sources; find experts; identify influential community members.
- Design of rich visualization and interaction interfaces for presenting dynamic, personalized news and learning about implicit relationships between news stories and reader communities.
- Case-study evaluation of developed computational journalism methodology in a production setting, to provide a critical analysis of practical impact on newsroom quality, efficiency, and economics (cost and revenue).
Activities
We research new tools for providing greater automation in news gathering, analysis, and delivery, while respecting practical constraints of news producers and consumers. We emphasize decomposition of stories into finer-grained elements and discovery of implicit relations between them. We also emphasize the relationship between news and social networks, both explicit and implicit, which underlie the news and significantly shape its content, quality, and authority. Hands-on experience in the newsroom will enable practitioners to innovate current practice of news production and identify important avenues for future research in computational journalism.
REACTION is organized in seven complementary research tasks which jointly address the four problem areas identified above:
- Mining Resources (lead by: Paula Carvalho, INESC-ID)
- Entity and Event Tracking (lead by: Bruno Martins, INESC-ID)
- Web Community Sensing (lead by: Carlos Soares, FEUP)
- Tracking Information Flow (lead by: Francisco Couto, LASIGE)
- Interaction and Personalization (lead by: Mário J. Silva, UTA)
- Query and Visualization (lead by: Carlos Soares, FEUP)
- Computational Newsroom (lead by: António Granado, CIMJ)
Funding
Reaction is a Strategic Research and development project in Interactive and Advanced Digital Media, funded by the CoLab, UT Austin | Portugal International Collaboraboratory for Emerging Technologies
- Proj #: UTA-Est/MAI/0006/2009
- Period: 1-October-10 to 30-September-13
- FCT: €220.000,00 (Portuguese Universities)
- FCT to UTA
- support from Público and PT Comunicações (Sapo)
- FCT Scholarships SFRH/BD/70478/2010 (D. Batista) and SFRH/BPD/45416/2008 (P. Carvalho)
Research Team
INESC-ID Team:
- Mário J. Silva (principal investigator)
- Francisco Couto (from LASIGE/FCUL)
- Paula Carvalho
- Bruno Martins
- David Batista
- Ivo Anastácio
- Silvio Moreira
FEUP/LIACC Team:
- Carlos Soares
- Eduarda Mendes Rodrigues
- Eugénio Oliveira
- Arian Pasquali (Sapo Labs)
- Gustavo Laboreiro
- Jorge Teixeira (Sapo Labs)
UT Austin Team:
UNL/CIMJ Team
Media Industry (PT Comunicações):
Media Industry (Público):
Former members:
- Luís Sarmento
- Matko Bosnjak
- Hohyon Ryu
- Diogo Figueiredo
- João Ramalho
- Nuno Baldaia
- Andrija Cajic
Presentations
- Mário J. Silva, REACTION. Gulbenkian Foundation, PT. CoLab annual conference. September, 2010.:
- Luís Sarmento, APIs Semânticas SAPO Labs (Slides). Presentation at Codebits V. November, 2011.
See also our REACTION Workshops page with technical presentations and progress tracking meetings.
Buzz
Follow us on Scoop.it! for more recent news.
- "Socrates comeback", March 2013:
- 2013-03-28 edition: ´Gostar´, ´culpar´ e ´mentira´ foram as palavras mais repetidas nos tweets com menções a Sócrates
- 2013-03-31 at the Sapo Portal: MVDI
- Publico series on PhDs in Portugal:
- 2013-03-08 edition: A evolução dos doutoramentos
- Publico series on marriages in Portugal:
- 2013-02-17 edition: O destino já não passa pelo casamento e não se rompe com o divórcio
- 2013-02-13 edition: O que moldou as famílias portuguesas desde 1864
- Publico on twitteuro:
- 2012-07-02 edition: ‘Ganhar’ foi a palavra mais repetida pelos portugueses no Twitter
- 2012-06-28 edition: Trezentos mil tweets durante o Portugal-Espanha
- 2012-06-21 edition: Jogo com a República Checa marca recorde para Portugal no Twitter
- 2012-06-17 edition: Cristiano referido mais de 3 mil vezes no Twitter durante o jogo
- 2012-06-13 edition: Twitteuro mede popularidade dos jogadores e equipas do Euro
- Sapo Notícias, 2011-11-10 edition: Nova Ferramenta Mostra Rede De Ligações entre Personalidades nas Notícias
- Canal UP, 2011-05-23 Twitómetro: projecto académico quer evoluir até à previsão de resultados eleitorais
- RTP 1, 2011-05-20 edition: Especial Informação - Twitómetro part 1 part 2 part 3
- Ionline, 2005-05-18 edition: Sócrates é o campeão da twittosfera, para o bem e para o mal
- SIC Notícias, 2005-05-27 edition: Sócrates e Passos Coelho lideram twitómetro
- Publico on twitómetro:
- 2011-05-20 edition: Twitómetro mede opinião sobre líderes políticos no Twitter
- 2011-05-21 edition: Twitómetro II Sócrates e Passos com mesma percentagem de menções negativas
- 2011-05-22 edition: Twitómetro III Sócrates é o mais mencionado, mas Portas e Louçã com mais comentários positivos
- 2011-05-23 edition: Twitómetro IV Sócrates e Passos reúnem maioria de comentários negativos
- 2011-05-24 edition: Twitómetro V Sócrates é o candidato mais citado e com mais comentários negativos
- 2011-05-25 edition: Twitómetro VI Passos Coelho é o candidato com menos comentários positivos
- 2011-05-26 edition: Twitómetro VII Jerónimo de Sousa foi o candidato com mais comentários negativos
- 2011-05-27 edition: Twitómetro VIII Comentários sobre Passos Coelho disparam em dia marcado pelo aborto
- 2011-05-28 edition: Twitómetro IX Jerónimo volta a ser o candidato com mais comentários negativos
- 2011-05-29 edition: Twitómetro X Sócrates e Louçã foram os líderes com mais comentários positivos
- 2011-05-30 edition: Twitómetro XI Louçã e Portas com maioria de comentários positivos
- 2011-05-31 edition: Twitómetro XII Jerónimo e Portas com maioria de comentários positivos
- 2011-06-01 edition: Twitómetro XIII Nenhum líder político obteve ontem maioria de comentários positivos
- 2011-06-02 edition: Twitómetro XIV Sócrates reúne quase 60 por cento dos comentários
- 2011-06-03 edition: Twitómetro XV Comentários negativos sobre Paulo Portas disparam
- Notícias SAPO: Especial Eleições Legislativas 2011:
- 2011-05-22 edition: Sócrates e Passos Coelho empatados entre quem diz bem
- 2011-05-23 edition: Sócrates com twitosfera mais favorável no arranque da campanha
- 2011-05-24 edition: Passos Coelho melhor que Sócrates e Portas
- 2011-05-25 edition: Um bom dia para Paulo Portas
- 2011-05-26 edition: Falou-se mais de Portas, falou-se melhor de Passos
- 2011-05-27 edition: Passos lidera tweets
- 2011-05-29 edition: Um bom domingo para Portas e Louçã
- 2011-06-03 edition: Temperaturas diferentes no fecho da campanha
- 2011-06-03 edition: Os tweets da recta final
- Notícias SAPO: Rúbrica O Mundo Visto Daqui:
- 2011-11-10 edition: Nova ferramenta mostra rede de ligações entre personalidades nas notícias
- 2011-11-15 edition: Paulo Bento e Cristiano Ronaldo em foco nas notícias
- All editions: Search SAPO Notícias for "o mundo visto daqui"
- Interactive version: "MVDi: Mundo Visto Daqui interactivo"
- Público on "Computational Journalism"
- 2012-09-09 edition: Passos Coelho pôs as redes sociais a ferver
- 2012-11-03 edition: Vergonha foi a palavra mais escrita no Facebook de Passos Coelho
Publications
G. Laboreiro, M. Bošnjak, E. Mendes Rodrigues, L. Sarmento and E. Oliveira. Determining language variant in microblog messages. To appear in: Proc. The 28th ACM Symposium On Applied Computing, SAC 2013, Information Access and Retrieval Track (IAR).
Duarte Dias, Ivo Anastácio, Bruno Martins (2012) Geocoding Textual Documents Through Hierarchical Classifiers Based on Language Models. Linguamática, Revista para o Processamento Automático das Línguas Ibéricas, 4(2)
Bosnjak, M., Sarmento, L., and Mendes Rodrigues, E. Robust Language Identification with RapidMiner - A Text Mining Use Case. To appear in: Hofmann, M. and Klinkenberg, R. (Eds.), Use Cases with RapidMiner.
Silvio Moreira, David S. Batista, Paula Carvalho, Francisco M. Couto, and Mário J. Silva. Tracking Politics with POWER. Program: electronic library and information systems. ISSN: 0033-0337 (forthcoming).
Hohyon Ryu, Matthew Lease, and Nicholas Woodward. Finding and Exploring Memes in Social Media. In Proceedings of the 23rd ACM Conference on Hypertext and Social Media. ACM, June 2012
Document
M. Bosnjak, E. Oliveira, J. Martins, L. Sarmento and E. Mendes Rodrigues. TwitterEcho - A Distributed Focused Crawler to Support Open Research with Twitter Data. In Proc. of SMANE 2012: Intl. Workshop on Social Media Applications in News and Entertainment, co-located with the ACM 2012 International World Wide Web Conference, WWW 2012, April 2012, Lyon, France.
David S. Batista, João D. Ferreira, Francisco M Couto, and Mário J. Silva. Toponym Disambiguation using Ontology-based Semantic Similarity. In Lecture Notes in Computer Science (LNCS) / Lecture Notes in Artificial Intelligence (LNAI), International Conference on Computational Processing of Portuguese (PROPOR), 17-20 April, 2012, Coimbra, Portugal.
Mário J. Silva, Paula Carvalho, Luís Sarmento. Building a Sentiment Lexicon for Social Judgement Mining. In Lecture Notes in Computer Science (LNCS) / Lecture Notes in Artificial Intelligence (LNAI), International Conference on Computational Processing of Portuguese (PROPOR), 17-20 April, 2012, Coimbra, Portugal.
G. Laboreiro, L. Sarmento and E. Oliveira. Identifying automatic posting systems in microblogs. The 4th Track on Text Mining and Applications (TeMA 2011) in the 15th Portuguese Conference of Artificial Intelligence (EPIA), October 2011, Lisbon, Portugal.
J. Teixeira, L. Sarmento and E. Oliveira. A bootstrapping approach for training a NER with Conditional Random Fields. The 4th Track on Text Mining and Applications (TeMA 2011) in the 15th Portuguese Conference of Artificial ntelligence (EPIA), October 2011, Lisbon, Portugal
Sousa-Silva, R.; Laboreiro, G.; Sarmento, L.; Grant, T.; Oliveira, E. and Maia, B. ‘twazn me!!! ;(’ Automatic Authorship Analysis of Micro-Blogging Messages. Procedings of the 16th International Conference on Applications of Natural Language to Information Systems (NLDB 2011), July 2011, Alicante, Spain.
Paula Carvalho, Luís Sarmento, Mário J. Silva, Jorge Teixeira, Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political Debates.9th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL-HTL) Portland, Oregon, USA, June, 2011.
Document
J. Teixeira, L. Sarmento and E. Oliveira. Semi-Automatic Creation of a Reference News Corpus for Fine-Grained Multi-Label Scenarios. Third Workshop on Intelligent Systems and Applications in 6ª Conferência Ibérica de Sistemas e Tecnologias de Informação (CISTI), June 2011, Chaves, Portugal
Mário J. Silva, REACTION TEAM, Notas sobre a Realização e Qualidade do Twitómetro Technical Report. Technical Report . University of Lisbon, Faculty of Sciences,LASIGE, May 2011.
Document
Silvio Moreira, David Batista, Paula Carvalho, Francisco Couto, Mário J. Silva, POWER - Politics Ontology for Web Entity Retrieval. ONTOSE 2011: 5th International Workshop on Ontology, Models, Conceptualization and Epistemology in Social, Artificial and Natural Systems. Lecture Notes in Business Information Processing, 2011, Volume 83, Part 8, 489-500, DOI: 10.1007/978-3-642-22056-2_51.
Document.
Mário J. Silva, Paula Carvalho, Carlos Costa, Luís Sarmento, Automatic Expansion of a Social Judgment Lexicon for Sentiment Analysis Technical Report. TR 10-08. University of Lisbon, Faculty of Sciences, LASIGE, December 2010. doi: 10455/6694
