This paper presents an empirical analysis of web news content during the 2008 USA Presidential Campaign. The analysis is based on a representative sample of online news sources, monitored on a daily basis, during and immediately after the campaign. Data was gathered from 38 websites of the highest-rated USA news sources across print and broadcast media, and several popular electronic media news sources. The sample mostly drew from the online affiliates and content sources of the highestcirculated newspapers and magazines in the USA and the top-rated (Nielsen 6/08) USA-based network and cable television stations. The web-based content of these media sources were assumed to mirror the content and tone of their corresponding traditional print and broadcast media. Keywords were identified from Obama's and McCain's nomination acceptance speeches and the three presidential debates. Keywords were determined from the most frequent content-bearing words found in the transcripts of these events. One goal of the analysis was to understand how different media sources presented each of the candidates and their key issues. A second goal was to analyze and visualize the network that was created by keywords within news sources' web pages. Network analysis tools were used to explore media behavior and trends during the campaign. The network's nodes represent keywords and candidates, while the frequency of co-occurrence of nodes is graphically represented by relative proximity. A set of centrality measures was applied to the network of keywords. Further research involves forecasting modeling, trend-setter analysis, applying a weighted taxonomy to keywords, linking keywords, and calculating distances between keywords within web pages.
Media Behavior During 2008 US Presidential Election: A Web Content Analysis
GRIPPA, FRANCESCA;
2009-01-01
Abstract
This paper presents an empirical analysis of web news content during the 2008 USA Presidential Campaign. The analysis is based on a representative sample of online news sources, monitored on a daily basis, during and immediately after the campaign. Data was gathered from 38 websites of the highest-rated USA news sources across print and broadcast media, and several popular electronic media news sources. The sample mostly drew from the online affiliates and content sources of the highestcirculated newspapers and magazines in the USA and the top-rated (Nielsen 6/08) USA-based network and cable television stations. The web-based content of these media sources were assumed to mirror the content and tone of their corresponding traditional print and broadcast media. Keywords were identified from Obama's and McCain's nomination acceptance speeches and the three presidential debates. Keywords were determined from the most frequent content-bearing words found in the transcripts of these events. One goal of the analysis was to understand how different media sources presented each of the candidates and their key issues. A second goal was to analyze and visualize the network that was created by keywords within news sources' web pages. Network analysis tools were used to explore media behavior and trends during the campaign. The network's nodes represent keywords and candidates, while the frequency of co-occurrence of nodes is graphically represented by relative proximity. A set of centrality measures was applied to the network of keywords. Further research involves forecasting modeling, trend-setter analysis, applying a weighted taxonomy to keywords, linking keywords, and calculating distances between keywords within web pages.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.