Satyamev Jayate, one of India’s highest-rated television shows, is using data as a means to effect meaningful change. The show’s producers are aggregating and analyzing the millions of messages they receive on controversial issues to do everything from planning future episodes to pushing for political change.
In order keep up with all the messages, Satyamev Jayate turned to Persistent Systems, an Indian IT consultancy with offices around the world, which created a system for automating their analysis. Here’s how the process works.
About a day-and-a-half before each show, Satyamev Jayate’s production company tells Persistent what the issue will be and the two groups come up with a taxonomy that will help the system sort through messages based on what topics will be brought up during Sunday’s show. But it’s not by any means the definitive list. As activity ramps up on Twitter while the show airs (tweet rates are highest during commercials and immediately after it ends, by the way), the team gets a sense of what topics are resonating with viewers and what themes they can expect in the nearly million responses that will follow.
When the responses actually do start pouring in after lunch, they hit a system designed by Persistent to automatically tag them and score them based on interest level and sentiment. So, as Mukund Deshpande, head of business intelligence and analytic at Persistent, told a long message with an interesting story will be marked as higher quality, while a short, congratulatory note will be scored lower. Because so many viewers write in “Hinglish,” a combination of Hindi and English, an off-the-shelf system wouldn't have been as accurate for processing these messages.
In the future, he’d like to train the system to recognize various gradients of emotion, too, beyond just simple sentiment. That means not just “positive” or “negative,” but also “happy,” “sad,” “angry” and any other way a viewer might be feeling. The best messages are then sent to a team of trained analysts — often college students and graduates, along with some Persistent employees — who decide which ones are worth following up on for a Friday radio show Khan does, and for placement on Satyamev Jayate’s web site. These analysts try to ensure that the stories shared are truthful and that the messages don’t contain personal information that could get viewers in trouble or affect their privacy. Data visualizations about how many people have responded and where they come from is available on the Impact section of the show’s site, as well as on separate Impact pages for each episode.
to amend the court system accordingly, the producer told me.
Sometimes, though, the results simply present an interesting — if not troubling — view into the Indian subconscious. Almost 32 percent of respondents, for example, voted in favor of the right of families to use force preventing the marriage of two willing adults (subsequent analysis uncovered some reasons why, including continuing opposition to inter-caste marriage), while almost 14 percent of respondents one week said that beating a woman is a sign of masculinity. And although women comprise only about 32 percent of the show’s audience, they have accounted for the majority of responses on shows addressing issues important to them.
The producer said his team also uses the data to inspire ideas for future shows and to populate a weekly radio show that Khan does with a local journalist. The Satyamev Jayate team analyzes the week’s messages in order to pick the most powerful and determine trends in viewers’ feelings, and Khan shares them during the interview. The second season, he said, will be shaped in part by how viewers responded to the format during the first season and the issues they want covered next.
Beyond just the next season, though — and the occasional political victory — the hope is that all the data Satyamev Jayate generates will have continuing utility. Deshpande said he’d like to see it used for ethnographic and social science research, because the data set is larger than most academic studies could generate (something that’s already happening with crowd sourced medical research
) and it’s very high quality because of the demographic and geographic information attached to it.
However, the producer seems perfectly content right now with the way Satyamev Jayate is resonating with the public. For example, he said, viewers are reporting crimes they previously might not have considered too big a deal and are reaching out to disabled citizens. This is the first time many people are speaking openly about these issues, he said, and they’re able to track the effects because they’re able to ensure no message is left behind.
In order keep up with all the messages, Satyamev Jayate turned to Persistent Systems, an Indian IT consultancy with offices around the world, which created a system for automating their analysis. Here’s how the process works.
About a day-and-a-half before each show, Satyamev Jayate’s production company tells Persistent what the issue will be and the two groups come up with a taxonomy that will help the system sort through messages based on what topics will be brought up during Sunday’s show. But it’s not by any means the definitive list. As activity ramps up on Twitter while the show airs (tweet rates are highest during commercials and immediately after it ends, by the way), the team gets a sense of what topics are resonating with viewers and what themes they can expect in the nearly million responses that will follow.
When the responses actually do start pouring in after lunch, they hit a system designed by Persistent to automatically tag them and score them based on interest level and sentiment. So, as Mukund Deshpande, head of business intelligence and analytic at Persistent, told a long message with an interesting story will be marked as higher quality, while a short, congratulatory note will be scored lower. Because so many viewers write in “Hinglish,” a combination of Hindi and English, an off-the-shelf system wouldn't have been as accurate for processing these messages.
In the future, he’d like to train the system to recognize various gradients of emotion, too, beyond just simple sentiment. That means not just “positive” or “negative,” but also “happy,” “sad,” “angry” and any other way a viewer might be feeling. The best messages are then sent to a team of trained analysts — often college students and graduates, along with some Persistent employees — who decide which ones are worth following up on for a Friday radio show Khan does, and for placement on Satyamev Jayate’s web site. These analysts try to ensure that the stories shared are truthful and that the messages don’t contain personal information that could get viewers in trouble or affect their privacy. Data visualizations about how many people have responded and where they come from is available on the Impact section of the show’s site, as well as on separate Impact pages for each episode.
Making a difference with data
All this feedback has an impact, both on the show itself and on India. Satyamev Jayate’s voting process, in particular, has yielded some impressive results. After the first episode about female feticide, or the selective abortion of female fetuses, 99.8 percent of viewers said they agreed with the idea of a fast-track court to prosecute doctors who perform such operations. When Khan presented the results to the Indian government, officials agreed almost immediatelyto amend the court system accordingly, the producer told me.
Sometimes, though, the results simply present an interesting — if not troubling — view into the Indian subconscious. Almost 32 percent of respondents, for example, voted in favor of the right of families to use force preventing the marriage of two willing adults (subsequent analysis uncovered some reasons why, including continuing opposition to inter-caste marriage), while almost 14 percent of respondents one week said that beating a woman is a sign of masculinity. And although women comprise only about 32 percent of the show’s audience, they have accounted for the majority of responses on shows addressing issues important to them.
The producer said his team also uses the data to inspire ideas for future shows and to populate a weekly radio show that Khan does with a local journalist. The Satyamev Jayate team analyzes the week’s messages in order to pick the most powerful and determine trends in viewers’ feelings, and Khan shares them during the interview. The second season, he said, will be shaped in part by how viewers responded to the format during the first season and the issues they want covered next.
However, the producer seems perfectly content right now with the way Satyamev Jayate is resonating with the public. For example, he said, viewers are reporting crimes they previously might not have considered too big a deal and are reaching out to disabled citizens. This is the first time many people are speaking openly about these issues, he said, and they’re able to track the effects because they’re able to ensure no message is left behind.
No comments:
Post a Comment