traderworld 发表于 2012-2-13 19:53:14

美国政府寻求通过特殊软件监控整个社交网站

FBI seeks digital tool to mine entire universe of social media By MARCUS WOHLSEN Associated Press February 12, 2012 7:52PM

[*]




Updated: February 13, 2012 2:10AM

SAN FRANCISCO — The U.S. government is seeking software that can mine social media to predict everything from future terrorist attacks to foreign uprisings, according to requests posted online by federal law enforcement and intelligence agencies.Hundreds of intelligence analysts already sift overseas Twitter and Facebook posts to track events such as the Arab Spring. But in a formal “request for information” from potential contractors, the FBI recently outlined its desire for a digital tool to scan the entire universe of social media — more data than humans could ever crunch.The Department of Defense and the Office of the Director of National Intelligence also have solicited the private sector for ways to automate the process of identifying emerging threats and upheavals using the billions of posts people around the world share every day.“Social media has emerged to be the first instance of communication about a crisis, trumping traditional first responders that included police, firefighters, EMT, and journalists,” the FBI wrote in its request. “Social media is rivaling 911 services in crisis response and reporting.”The proposals already have raised privacy concerns among advocates who worry that such monitoring efforts could have a chilling effect on users. Ginger McCall, director of the open government project at the Washington, D.C.-based Electronic Privacy Information Center, said the FBI has no business monitoring legitimate free speech without a narrow, targeted law enforcement purpose.“Any time that you have to worry about the federal government following you around peering over your shoulder listening to what you’re saying, it’s going to affect the way you speak and the way that you act,” McCall said.The FBI said in a statement to The Associated Press that their proposed system is only meant to monitor publicly available information and would not focus on specific individuals or groups but on words related to criminal activity.Analyzing public information is nothing new in the world of intelligence. During the Cold War, for example, CIA operatives read Russian newspapers and intercepted television and radio broadcasts in hopes of inferring what Soviet leaders were thinking.But the rise of social media over the past few years has dramatically changed both the kinds and amount of freely available information. For example, Twitter CEO Dick Costolo said at a recent conference that users of the micro-blogging service send out an average of one billion tweets every three days.“It really ought to be the golden age of intelligence collection in that you’ve got people falling all over themselves trying to express who they are,” said Ross Stapleton-Gray, a former CIA analyst and now a technology consultant who advises companies on security, surveillance and privacy issues.As a staffer in the early 1990s in what later became the Office of the Director of National Intelligence, Stapleton-Gray said the U.S. intelligence community’s early efforts to better harness the increasing volume of information becoming available on the Internet ran into resistance from old hands who believed that secrets were more valuable than the information anyone could get.But agencies’ requests for better social media tools indicate that resistance has wilted.The system sought by the research arm of the national intelligence director’s office would fuse together everything from Web searches to Wikipedia edits to traffic webcams to “beat the news” by predicting major events ranging from economic turmoil to disease outbreaks.The Defense Department’s tool would track social media to identify the spread of information that could affect soldiers in the field and also give the military ways to conduct its own “influence operations” on social networks to counteract enemy campaigns.The intelligence director’s office and the Defense Department said they could not meet the AP’s deadline to answer specific questions about the proposed projects.The FBI is seeking a web app that would automatically scrape social networks for data that could alert the agency’s operations center to breaking crises as they happen and plot them on interfaces like Google MapsFor such systems to work well, their developers would have to overcome several technological challenges, the easiest of which is handling the massive amount of data involved.Developments in so-called “cloud computing” have made processing big data sets easier than ever before by spreading the work broadly across networks of computers.Instead, experts in the field say the major hurdle is in effect teaching computers how to read. To sift the valuable information from the mundane, the software must understand the subtleties of meaning in tweets and blog posts to tell the difference between, for example, a serious statement and a joke.Solving such problems falls to researchers in fields such as natural language processing and computational linguistics — the same specialties that brought the world the iPhone’s Siri voice-activated assistant and IBM’s Watson, which trounced its human opponents at Jeopardy.San Francisco-based Linguastat Inc. worked with the Centers for Disease Control during the 2009 swine flu outbreak to track public fears and concerns on social networks and determine whether the CDC’s public health messages were gaining traction. Company co-founder John Pierre said that tracking public sentiment depended on much more than searching social media for specific words or phrases.“Just because they mention it, do they like it, do they not, are they saying it in the right context? Is it a band called The Swine Flu?” Pierre said.Authenticity also becomes an issue in analyzing social networks. Computer programs known as “bots” already plague services such as Twitter with junk posts similar to email spam. Researcher Tim Hwang has scripted his own bots to see how much influence they could wield over social networks and says the ability to create bots that closely mimic humans will only improve over time.This matters in intelligence gathering because bots could fool analysts — and their software — into thinking they’re witnessing a genuine shift in social trends that in reality could be a government propaganda campaign driven by, for example, Twitter users that don’t really exist.“We have all the data. How do we know what’s real and what’s not?” Hwang said.William McCants, an analyst at the Center for Naval Analyses and a former State Department official, monitors al-Qaeda propaganda online. He said he worries that the systems the FBI and other agencies are seeking could create an overreliance on technology at the expense of carefully trained human analysts who are still better at zeroing in on the facts that matter most.“The more data you use and the more complicated the software, the more likely it is you will confirm a well-known banality,” McCants said a friend likes to joke. “You didn’t need to be on Twitter to know that a revolution was happening in Egypt.”APCopyright 2012 Associated Press. All rights reserved. This material may not be published, broadcast, rewritten, or redistributed.

草帽飞了 发表于 2012-2-13 22:35:23

大洋两岸的网警看谁牛了。

psax 发表于 2012-2-14 12:51:04

声称是能用来发现恐怖分子。哪位懂行的能讲讲这社交网络数据挖掘有啥核心技术?非死不可那种腻在一起瞎聊的网站怎么挖掘出有用的东西?

netsouth 发表于 2012-2-14 14:35:27

psax 发表于 2012-2-14 12:51 static/image/common/back.gif
声称是能用来发现恐怖分子。哪位懂行的能讲讲这社交网络数据挖掘有啥核心技术?非死不可那种腻在一起瞎聊的 ...

前段时间不是刚有新闻, 说英国有个人在facebook上说要炸了美国, 结果去美国旅游时一下飞机就被逮捕了...

鱼儿汤 发表于 2012-2-14 21:34:34

即使技术上做得到,也需要大量的资源投入才能执行下来吧?

草帽飞了 发表于 2012-2-16 14:37:18

记得原来河里一个帖子,写的是纽约和伦敦的部分街头安装了音视频采集矩阵,后台是海量数据处理和挖掘分析、人工智能,有专门的警察盯着,数据开放给情报部门。估计非死不可、谷歌之类早就顺从了CIA FBI把数据库接口开放了吧。中东去年出的那些妖蛾子,不少是通过非死不可煽风,推特来煽风点火的。

莫飞 发表于 2012-2-19 09:19:00

psax 发表于 2012-2-14 12:51 static/image/common/back.gif
声称是能用来发现恐怖分子。哪位懂行的能讲讲这社交网络数据挖掘有啥核心技术?非死不可那种腻在一起瞎聊的 ...

最近几年真的有社交网络挖掘的成功例子
1. 日本研究证明,用twitter上进行地震预警比国家地震局发出的警报还要快
2. 非洲中东的暴乱,大多数都是年轻人先在社交网络上进行串联组织然后才到现实中进行游行的

当然说这里边的技术么,基本上都是瞎折腾,入门门槛很低

闻到阳光 发表于 2012-2-19 09:51:36

电子技术的发展成了统治的有力武器了

lafewu 发表于 2012-2-19 10:13:40

没什么太大的技术投入,主要的技术构建都已经成熟并且已经在大规模商用了。 海量数据挖掘:ebay, amazon,facebook
语音识别:IBM的技术很早60年代就有; 语义分析,核心敏感字分析: google,多点数据采集:这个华为也能干:D. 估计是不是通过GE协调了一把,这个系统就成了?:D 无非就是弄点廉价的PC server而已。

草帽飞了 发表于 2012-2-23 21:08:18

TG说不定早就干上了,哈哈。
页: [1]
查看完整版本: 美国政府寻求通过特殊软件监控整个社交网站