<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Umbrella Security Labs</title>
	<atom:link href="http://labs.umbrella.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://labs.umbrella.com</link>
	<description>Just another WordPress site</description>
	<lastBuildDate>Fri, 24 May 2013 17:35:46 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Big Data Driven Security with Splunk</title>
		<link>http://labs.umbrella.com/2013/05/24/big-data-driven-security-with-splunk/</link>
		<comments>http://labs.umbrella.com/2013/05/24/big-data-driven-security-with-splunk/#comments</comments>
		<pubDate>Fri, 24 May 2013 17:35:46 +0000</pubDate>
		<dc:creator>Ping</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Innovation]]></category>

		<guid isPermaLink="false">/?p=5392</guid>
		<description><![CDATA[<p>In order to deliver predictive threat protection to our customers, the Umbrella Security Labs research team has to collect and correlate data from various sources in innovative ways. We&#8217;ve shared in previous posts how our team applies proprietary algorithms to data from the OpenDNS Global Network, but we&#8217;re constantly on the hunt for  easy-to-use data platforms [...]</p><p>The post <a href="http://labs.umbrella.com/2013/05/24/big-data-driven-security-with-splunk/">Big Data Driven Security with Splunk</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>In order to deliver predictive threat protection to our customers, the Umbrella Security Labs research team has to collect and correlate data from various sources in innovative ways. We&#8217;ve shared in previous posts how our team applies <a title="Umbrella Security Graph" href="/security-graph/">proprietary algorithms</a> to data from the <a title="Global Network" href="/global-network/">OpenDNS Global Network</a>, but we&#8217;re constantly on the hunt for  easy-to-use data platforms that allow for real-time and interactive data visibility. </p>
<p>That&#8217;s why we wanted to share a bit about our experience with Splunk, a big data management system that provides fast machine data parsing, indexing, searching and data analyses. The GUI interface, dashboard and availability of security-related add-ons make for a neat out-of-the-box solution for enhanced data visibility. </p>
<p><strong>Splunk Basic Usage</strong></p>
<p>Installation of Splunk base is rather straightforward. Check out their <a href="http://docs.splunk.com/Documentation/Splunk/latest/Installation/Whatsinthismanual">official docs</a> for installation instructions. When you&#8217;re getting started, these are some of the basic ways to use Splunk: <em>add data to splunk (data input), search, delete, data aggregation, data transformation, and charting</em>. </p>
<p>If you&#8217;re using customized data, you&#8217;ll likely find input to be the trickiest part. That&#8217;s where Splunk will have to figure out the correct data format, and properly parse it to extract fields. Splunk tries to automatically break the raw blob of textual input into EVENTS based on default or customized event breaking settings, and recognize the timestamp for each event. These settings can be customized both via Splunk GUI or command line interface (CLI). Make changes props.conf file to tell Splunk how to treat your data with correct configurations.</p>
<p>An example of extracting tab delimited fields from my input data: <a style="font-size: 13px;" href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_propsconf1.png"><img class="alignnone size-medium wp-image-5420" alt="splunk_propsconf" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_propsconf1-300x94.png" width="300" height="94" /></a></p>
<p>For data queries and other operations (aggregating, data transforming etc.), Splunk’s pipe syntax seems pretty straightforward.</p>
<p>The following query that maps out a number of IP addresses that fits certain criteria serve as a good example of basic query syntaxes. The example requires the geoIP mapping app provided by Maxmind, and amMap, a mapping app. </p>
<pre style="padding-left: 30px;">sourcetype=mute* | rex "(?d+.d+.d+.d+)"| search ip!=192.168* ip!=0.0.* ip!=10.*|  stats count by ip | eval count_label="Event" | eval iterator="ip" | eval iterator_label="IP" |  eval zoom = "zoom="334%" zoom_x="-128.58%" zoom_y="-113.11%""| eval movie_color="#FF0000" | eval output_file="home_threat_data.xml" | eval app="amMap" | lookup geoip clientip as ip | search client_country!=^$ | mapit</pre>
<p><strong>Splunk data forwarding and receiving </strong></p>
<p>Install the <a href="http://splunk-base.splunk.com/answers/50082/how-do-i-configure-a-splunk-forwarder-on-linux">universal forwarder</a> if your have remote data. The universal forwarder gathers data from servers where your input data reside and forwards them to your main Splunk server for indexing and searching. </p>
<pre style="padding-left: 30px;">./splunk add forward-server [splunk server:port]</pre>
<pre style="padding-left: 30px;">/opt/splunkforwarder/bin/splunk add monitor /path/to/app/logs/ -index main -sourcetype %app%</pre>
<p>At the same time, enable receiver &#8211; the main Splunk server and indexer by going to Splunk GUI, in forwarding and receiving-&gt;add new -&gt; TCP port [port]</p>
<p>To troubleshoot the deployment, check these internal logs at the receiving indexer:</p>
<pre style="padding-left: 30px;">$SPLUNK_HOME/var/log/splunk/splunkd.log</pre>
<pre style="padding-left: 30px;">$SPLUNK_HOME/var/log/splunk/license_audit.log</pre>
<p><strong>Use cases for Splunk security apps </strong></p>
<p>Splunk base has a set of charting choices. In the following example, we made a pie chart of user agent distribution of our mobile clients data. </p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_pie1.png"><img class="alignnone  wp-image-5432" alt="splunk_pie1" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_pie1.png" width=" 421" height="247" /></a></p>
<p>Snort app has been a great tool for quick network threat monitoring and alerting. We can easily retrieve all the entries that triggered snort, and perform in-depth investigations given the source IP addresses and contextual network data. </p>
<h5><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_snort_stats.png"><img alt="splunk_snort_stats" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_snort_stats.png" width="421" height="254" /></a><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_snort_stats_2.png"><img class="alignnone  wp-image-5398" alt="splunk_snort_stats_2" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/splunk_snort_stats_2.png" width="420" height="378" /></a> </h5>
<p>Snort and amMap makes use of Maxmind&#8217;s geo-ip mapping to give us an instant global look at the threat&#8217;s scale and spreading patterns.  </p>
<p> <a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/snort_map_source_ip.png"><img alt="snort_map_source_ip" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/snort_map_source_ip.png" width="421" height="258" /></a></p>
<h5>Conclusion</h5>
<p>We have yet to explore Splunk&#8217;s other interesting capabilities, such as real-time correlation making and alerting, or its distributed system deployment scheme (with Hadoop integration). We’ve spent lots of time with Hadoop and Hbase, which are largely back-end systems. As far as our primitive use of Splunk goes, it seems to serve quite well as a front-end portal for internal search, query and reporting. Data parsing for customized data is not as intuitive. It would be great if it provided pipe-like syntax for data input, as well. </p>
<p>The post <a href="http://labs.umbrella.com/2013/05/24/big-data-driven-security-with-splunk/">Big Data Driven Security with Splunk</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/05/24/big-data-driven-security-with-splunk/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Announcing the Security Community Forums</title>
		<link>http://labs.umbrella.com/2013/05/20/announcing-security-community-forums/</link>
		<comments>http://labs.umbrella.com/2013/05/20/announcing-security-community-forums/#comments</comments>
		<pubDate>Mon, 20 May 2013 16:47:18 +0000</pubDate>
		<dc:creator>Vinny</dc:creator>
				<category><![CDATA[Security Community]]></category>
		<category><![CDATA[Security Research]]></category>
		<category><![CDATA[Umbrella Security Graph]]></category>

		<guid isPermaLink="false">/?p=5325</guid>
		<description><![CDATA[<p>Community has always been a priority at OpenDNS, so we’re thrilled to announce that the new Umbrella Security Community Forums are now online. We created the forums so our community had a central place to discuss new threats appearing on the landscape, malware samples, security research, and the Umbrella Security Community review process. The top [...]</p><p>The post <a href="http://labs.umbrella.com/2013/05/20/announcing-security-community-forums/">Announcing the Security Community Forums</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p dir="ltr"><a title="Security Community and Partners" href="/icon-list-item/security-community-and-partners/">Community</a> has always been a priority at OpenDNS, so we’re thrilled to announce that the new Umbrella Security Community Forums are now online. We created the forums so our community had a central place to discuss new threats appearing on the landscape, malware samples, security research, and the Umbrella Security Community review process.</p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/Screen-Shot-Forums.jpg"><img class="aligncenter size-full wp-image-5327" alt="Screen Shot Forums" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/Screen-Shot-Forums.jpg" width="600" height="241" /></a></p>
<p>The top contributors of the community also get access to the <a href="http://labs.umbrella.com/security-graph/">Umbrella Security Graph</a>, OpenDNS’s proprietary research tool that provides the <a title="About Umbrella Security Labs" href="/about-us/">Umbrella Security Labs</a> research team with specific information about domains and allows us to review sites that are going to host malware, bot networks, and phishing before they actually become malicious. The data is sourced from the 50+ billion DNS requests OpenDNS receives each day from 50 million customers in more than 150 countries.</p>
<p>We’re still searching for people to join our passionate security community, so if this sounds like something you would be interested in, then we would love to have you! Join members in Australia, India, Africa, UK, Switzerland, Sweden, Canada and in various parts of the US &#8211; just let us know <a href="http://community.opendns.com/domaintagging/malware_application.php">what caught your eye</a>.</p>
<p>The post <a href="http://labs.umbrella.com/2013/05/20/announcing-security-community-forums/">Announcing the Security Community Forums</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/05/20/announcing-security-community-forums/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Details on exploit kits, as told by the Umbrella Security Graph</title>
		<link>http://labs.umbrella.com/2013/05/17/a-quick-look-at-domains-used-for-exploit-kits/</link>
		<comments>http://labs.umbrella.com/2013/05/17/a-quick-look-at-domains-used-for-exploit-kits/#comments</comments>
		<pubDate>Fri, 17 May 2013 16:17:02 +0000</pubDate>
		<dc:creator>Dhia</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Security Research]]></category>
		<category><![CDATA[threat discovery]]></category>
		<category><![CDATA[Umbrella Security Graph]]></category>

		<guid isPermaLink="false">/?p=4877</guid>
		<description><![CDATA[<p>The Umbrella Security Labs combines our proprietary research tool the Umbrella Security Graph (Sgraph) with various investigative methods and backend predictive algorithms and classifiers to uncover new sets of suspicious and malicious domains each day. We leverage these technologies to discover domains before they are used in the wild, with the goal of ensuring that customers [...]</p><p>The post <a href="http://labs.umbrella.com/2013/05/17/a-quick-look-at-domains-used-for-exploit-kits/">Details on exploit kits, as told by the Umbrella Security Graph</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p dir="ltr">The Umbrella Security Labs combines our proprietary research tool the <a title="Umbrella Security Graph" href="/security-graph/">Umbrella Security Graph</a> (Sgraph) with various investigative methods and backend predictive algorithms and classifiers to uncover new sets of suspicious and malicious domains each day. We leverage these technologies to discover domains <em>before</em> they are used in the wild, with the goal of ensuring that customers using <a href="http://www.umbrella.com">Umbrella by OpenDNS</a> receive the best protection possible. But given the lofty nature of this claim, we also seek to expose our research and findings so that it&#8217;s clear how we accomplish these goals. For example, in a previous <a href="http://labs.umbrella.com/2013/04/15/on-the-trail-of-malicious-dynamic-dns-domains/">post</a>, we explored the relationships between dynamic DNS domains and malware sites. In this blog, we look at sample malicious URLs to determine the most frequently used exploit kits, and discuss some of the notable features of these URLs&#8217; domains.</p>
<p dir="ltr"><strong>An Overview of Exploit Kits</strong></p>
<p dir="ltr">Exploit kits are an extremely efficient and popular way of distributing malware to gain control of people’s computers. Let&#8217;s quickly review how exploit kits work: First, a user’s traffic is redirected to a server that runs an exploit kit Web application. Then, the kit detects existing vulnerabilities in the software or operating system running on the user’s machine. The kit exploits these vulnerabilities by stealthily dropping a malware payload (directly or via further redirections) on the user’s computer. As a result, the attacker takes over the user’s machine. The ecosystem of exploit kits is quite vast, and these kits can differ technically or with regard to their business model. A good poster showing the common exploit kits of 2012 is available on deependresearch.org <a href="http://www.deependresearch.org/2012/11/common-exploit-kits-2012-poster.html">site</a>.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/1780x1200_CommonExploitPacks2012u18.jpg"><img class=" wp-image-4911 aligncenter" alt="1780x1200_CommonExploitPacks2012u18" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/1780x1200_CommonExploitPacks2012u18.jpg" width="538" height="363" /></a></p>
<p>[Exploit kits poster published on http://www.deependresearch.org/2012/11/common-exploit-kits-2012-poster.html]</p>
<p dir="ltr"><strong>An Umbrella Security Labs Exploit Kit Study</strong></p>
<p>To perform a study on how exploit kits are behaving, we take random samples of malicious URLs from our database for a period of 7 days, and use Yara (more on the tool at the end of this post) rules to detect known exploit kits URLs and report the percentage of URLs for each exploit kit. We also analyze the domains of the URLs for any noticeable features.</p>
<p style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/top-exploit-kits-sample.png"><img class=" wp-image-4879 aligncenter" alt="top-exploit-kits-sample" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/top-exploit-kits-sample.png" width="523" height="315" /></a></p>
<p dir="ltr">In the figure above, we see the top 3 most frequent exploit kits in the URL samples. As expected, Blackhole is the most prevalent followed by Red kit, then Safepack.</p>
<p dir="ltr">We then take as an example, the URLs that are related to Blackhole and further investigate their domains using the <a title="Umbrella Security Graph" href="/security-graph/">Umbrella Security Graph</a> (Sgraph). For the example domain dfudont[.]ru, Sgraph shows in the figure below that it is fastflux, hosted on an ASN with a suspicious score, has a low C-Rank and that the domain has been assigned a low score by our classifier.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/bh-ff-sgraph.png"><img class=" wp-image-4881 aligncenter" alt="bh-ff-sgraph" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/bh-ff-sgraph.png" width="584" height="838" /></a></p>
<p dir="ltr"><strong>Details on a Sample of Blackhole Domains</strong></p>
<p dir="ltr">Domains such as jindalo[.]ru, ijsiokolo[.]ru, ifikangloo[.]ru, ejjiipprr[.]ru, and dfudont[.]ru, are detected as fastflux and registered in January-April 2013. One of our <a href="http://labs.umbrella.com/2013/02/13/kelihos-in-its-third-incarnation/">predictive algorithms</a> detects fastflux domains daily. These domains are often registered recently then start triggering DNS traffic. </p>
<p dir="ltr">The domain ehrap[.]net is registered on May 2nd, and stays dormant for 5 days. It starts being DNS-active on May 7 and our fastflux detection system catches it on that day, which is also the same day it begins delivering malware in a <a href="http://blog.dynamoo.com/2013/05/amazoncom-spam-ehrapnet.html">fake Amazon spam campaign</a>. On May 8, we catch the domain nilokwe[.]pw as fastflux, which is the <a href="http://registry.pw/whois/?query=nilokwe.pw&amp;output=nice">same day it is registered</a>. On May 9, we detect the domain pinformer[.]net as fastflux. The three domains <a href="http://urlquery.net/report.php?id=2384069">ehrap[.]net</a>, <a href="http://urlquery.net/report.php?id=2393671">nilokwe[.]pw</a>, <a href="http://urlquery.net/report.php?id=2398756">pinformer[.]net</a> are all Blackhole related domains as urlquery reports it (click on the domain name to see the urlquery report). Furthermore, as of May 8, the three domains ehrap[.]net, nilokwe[.]pw, and pinformer[.]net use the same name server(s): ns1[.]recorderbooks[.]net and ns2[.]recorderbooks[.]net. ns[1-2].recorderbooks[.]net also serve as name servers for nopfrog[.]pw. This domain was registered on <a href="http://registry.pw/whois/?query=nopfrog.pw&amp;output=nice">May 4</a>, stayed dormant, then started activity as a Blackhole domain on May 9-10 (<a href="http://urlquery.net/report.php?id=2410210">urlquery report</a>). We spotted it at the same time via its name server association with fastflux Blackhole domains.</p>
<p dir="ltr">justcollega[.]net is hosted on 37.59.215.18. This IP is hosting several other known malicious domains, and others not yet flagged as malicious. Some of the latter domains were registered in early May, e.g. ksufsdkjvbdskvsdkvsdv[.]com on May 2, justcollega[.]com on May 4, burnbug[.]net on May 5, zohoretail[.]biz on May 5, and contasesso[.]net on May 6. These domains are suspicious and justify that we block them or quarantine them.</p>
<p dir="ltr">Several known Blackhole related domains are hosted on the same IPs, as the figure below shows (from the studied sample). One such IPs is 37.230.116.89. This IP hosts other domains, and similarly several are registered recently. For example, kawsedrol[.]us, kawsedrol[.]info, kawsedrol[.]biz, qertroli[.]us, qertroli[.]info, polkawsed[.]org, polkawsed[.]info are all registered on May 1st 2013. Some of these domains already have known subdomains hosting or redirecting to malware. This justifies that we block the 2LDs.</p>
<p dir="ltr"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/bh-doms-clustered.png"><img class="wp-image-4883 alignnone" alt="bh-doms-clustered" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/bh-doms-clustered.png" width="696" height="320" /></a></p>
<p><strong>More Details on Yara, the Tool for Our Study</strong></p>
<p dir="ltr"><a href="https://code.google.com/p/yara-project/">Yara</a> is a malware identification and classification tool that is fast at detecting textual patterns in URL samples. One can use known regular expressions of exploit kits&#8217; URLs as rules and fire up the Yara engine to see which URLs match the rules. For example, a known string rule for detecting certain Blackhole URLs looks like:</p>
<p>rule blackhole : exploit_kit<br />{<br />    strings:<br />        $a = /.php?.*?:[a-zA-Z0-9:]{6,}&amp;.*?&amp;/<br />    condition:<br />        $a<br />}</p>
<p dir="ltr">Another rule for detecting certain Red kit URLs used for dropping malware payloads looks like:</p>
<p>rule redkit_bin : exploit_kit<br />{<br />    strings:<br />        $a = //d{2}.htmls/<br />    condition:<br />        $a<br />}</p>
<p>&nbsp;</p>
<p>The post <a href="http://labs.umbrella.com/2013/05/17/a-quick-look-at-domains-used-for-exploit-kits/">Details on exploit kits, as told by the Umbrella Security Graph</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/05/17/a-quick-look-at-domains-used-for-exploit-kits/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>D3 + MV* Framework Visualization Meetup</title>
		<link>http://labs.umbrella.com/2013/05/13/d3-mv-framework-visualization-meetup/</link>
		<comments>http://labs.umbrella.com/2013/05/13/d3-mv-framework-visualization-meetup/#comments</comments>
		<pubDate>Mon, 13 May 2013 20:15:02 +0000</pubDate>
		<dc:creator>Ping</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Open Source and Tools]]></category>
		<category><![CDATA[Security Community]]></category>
		<category><![CDATA[Security Research]]></category>

		<guid isPermaLink="false">/?p=5079</guid>
		<description><![CDATA[<p>Umbrella Security Lab researchers consider visualization and design the key to data driven security intelligence. In a recent post, OpenDNS CTO Dan Hubbard shared insight on evolving security Venn and the role security visualization plays in the shift.  Below is a look at how our team handles almost any data exploration process.  We can look [...]</p><p>The post <a href="http://labs.umbrella.com/2013/05/13/d3-mv-framework-visualization-meetup/">D3 + MV* Framework Visualization Meetup</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/d3.png"><br /></a>Umbrella Security Lab researchers consider visualization and design the key to data driven security intelligence. In a recent <a href="http://labs.umbrella.com/2013/04/22/evolution-of-the-security-venn/">post</a>, OpenDNS CTO Dan Hubbard shared insight on evolving security Venn and the role security visualization plays in the shift.  Below is a look at how our team handles almost any data exploration process. <a style="font-size: 13px;" href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/dataexpl.png"><img class="wp-image-5085 alignnone" alt="dataexpl" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/dataexpl.png" width="440" height="104" /></a></p>
<p>We can look at data from <a title="Breaking news: Traffic from Syria Disappears from Internet" href="/2013/05/07/breaking-news-traffic-from-syria-disappears-from-internet/">Syria&#8217;s most recent Internet blackout</a> for further evidence on how telling a graph can be. Below are DNS traffic plots we made using mathematica. The first plot shows the hours before and after Syria&#8217;s went offline. The second shows when Syria came back online the next day. </p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/syria-before1.png"><img class="wp-image-5093 alignleft" alt="syria-before" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/syria-before1.png" width="253" height="174" /></a></p>
<p><a style="font-size: 13px;" href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/syria-after1.png"><img alt="syria-after" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/syria-after1.png" width="270" height="174" /></a></p>
<p>&nbsp;</p>
<p>There are tons of exciting technologies that can be used for visualizing data. D3, Angular, Amber, and backbone.js are just a few. We use Angular and Highcharts in the <a title="Umbrella Security Graph" href="/security-graph/">Umbrella Security Graph</a>, and we&#8217;re using Maltego and amMap, ggplot2 (R), and OpenGL as well. But there are still many questions to be answered: </p>
<p><i>What are the coolest visualization examples we can draw ideas from? </i></p>
<p><i>Which of these techniques make a good stack for use out-of-the-box? </i></p>
<p><i>What does visualization mean for big data? </i></p>
<p><i>Which chart types can reveal patterns for discovery, rather than just being eye candy?  </i></p>
<p><i>What techniques allow us to interact with a graph, exploring the rich dimensions of our data? </i></p>
<p><i> </i></p>
<p><a style="font-size: 13px;" href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/d3.png"><img class="alignleft" alt="d3" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/d3.png" width="206" height="125" /></a>To try and answer some of these questions, we invited the experts from Bay Area D3 User Group to the OpenDNS headquarters.</p>
<p>Miles McCrocklin [<a title="Miles McCrocklin" href="http://www.youtube.com/watch?v=Hd2rye9a9kk">video</a>, <a href="http://bl.ocks.org/milroc/raw/5553051/#0">slides</a>] and Chris Viau [<a title="Christoph D3" href="http://www.youtube.com/watch?v=EoAY0hArT80">video</a>, <a href="https://docs.google.com/presentation/d/1YMwsyl0r1XXMS-V1ZWXgpPApFf4ZqEbLbt_P-Vwvv1k/edit?pli=1">slides</a>] each presented a different-yet-unified focus on reusable D3 visualizations, providing a very good collection of <a href="http://bl.ocks.org/milroc/raw/5553051/#46">resources</a>.  Jyri Tuulos [<a title="D3 + Backbone" href="http://www.youtube.com/watch?v=mS4EgJvy1aQ">video</a>, <a href="http://jtuulos.github.io/bayd3-may2013/">slides</a>] covered how to build object-oriented D3 charts using Views in Backbone.js, including structuring the code, the distribution of responsibilities between the libraries, and extending chart classes from a single base class.</p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/photo-19.jpg"><img class="alignnone  wp-image-5109" alt="photo (19)" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/photo-19.jpg" width="346" height="259" /></a></p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/photo-4.jpg"><img class=" wp-image-5107 alignnone" alt="photo 4" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/photo-4.jpg" width="346" height="259" /></a></p>
<p>If all this talk of Big Data leaves you wanting to roll up your sleeves and do some data mining of your own, we&#8217;ve got a great place for you to get started. Below you&#8217;ll find the BGP data we observed during the Syrian Internet Blackout. Can&#8217;t wait to hear more about what you discover. </p>
<p>[download the data <a href="http://portal.bgpmon.net/data/bgpmon-82.137.192.0-18.txt">link1</a> <a href="http://portal.bgpmon.net/data/bgpmon-90.153.128.0-17.txt">link2</a>] [<a href="http://pastebin.com/sn4kLrFU">data description</a>] </p>
<p>&nbsp;</p>
<p>The post <a href="http://labs.umbrella.com/2013/05/13/d3-mv-framework-visualization-meetup/">D3 + MV* Framework Visualization Meetup</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/05/13/d3-mv-framework-visualization-meetup/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why real-time detection of compromised hosts has become critical</title>
		<link>http://labs.umbrella.com/2013/05/10/why-real-time-detection-of-compromised-hosts-has-become-critical/</link>
		<comments>http://labs.umbrella.com/2013/05/10/why-real-time-detection-of-compromised-hosts-has-become-critical/#comments</comments>
		<pubDate>Fri, 10 May 2013 18:19:17 +0000</pubDate>
		<dc:creator>Frank Denis</dc:creator>
				<category><![CDATA[Security Research]]></category>
		<category><![CDATA[threat discovery]]></category>
		<category><![CDATA[scam]]></category>
		<category><![CDATA[spam]]></category>

		<guid isPermaLink="false">/?p=4573</guid>
		<description><![CDATA[<p>A paradigm shift Newly registered domains. Pseudorandom names. Short TTLs. A myriad of IPs spread over unrelated ASNs, most of them being already well known for hosting malicious content. These are strong indicators, among others, that a domain is very likely to be malicious. Our team has been using algorithms that can automatically spot these malicious [...]</p><p>The post <a href="http://labs.umbrella.com/2013/05/10/why-real-time-detection-of-compromised-hosts-has-become-critical/">Why real-time detection of compromised hosts has become critical</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<h1>A paradigm shift</h1>
<p>Newly registered domains. Pseudorandom names. Short TTLs. A myriad of IPs spread over unrelated ASNs, most of them being already well known for hosting malicious content. These are strong indicators, among others, that a domain is very likely to be malicious. Our team has been using algorithms that can automatically spot these malicious trends and block them for our customers.  </p>
<p>However, the security industry is currently observing a significant paradigm shift. Spammers, scammers and malware authors are now massively abusing compromised machines in order to operate their business. Hours after the Boston Marathon bombing tragedy, a spam campaign drove recipients to a web page containing actual videos of the explosion. What site visitors didn&#8217;t know is that the page also contained a malicious iframe exploiting a recent vulnerability in Java that would download and install the Kelihos trojan.</p>
<p>Here are some of the infectors serving the malicious Java bytecode:</p>
<pre>kentuckyautoexchange.com</pre>
<pre>infoland.xtrastudio.com</pre>
<pre>aandjlandscapecreations.com</pre>
<pre>detectorspecials.com</pre>
<pre>incasi.xtrastudio.com</pre>
<pre>sylaw.net</pre>
<pre>franklincotn.us</pre>
<pre>earart.com</pre>
<pre>bigbendrivertours.com</pre>
<pre>aeroimageworks.com</pre>
<pre>winerackcellar.com</pre>
<p>In order to evade most blacklists, the iframe in the video web page was switching to a new URL roughly every hour. Around the same time, other spam campaigns led to a similar page. Thousands of web sites were involved. We quickly set up scripts to monitor these web pages and block newly discovered domains in real time.</p>
<p>What all of these websites have in common is that they were not malicious. These were totally benign web sites, established for a long time, with a decent security track record and no distinctive network features. But in the blink of an eye they were compromised, and started infecting thousands of machines with malware. (Running software known to have well-known vulnerabilities didn&#8217;t help.)</p>
<p>A compromised host is a powerful weapon for malware authors. Having full control of a system makes it easy to serve different content according to the web browser, referrer, time or other criteria. The code can be updated anytime, in order to ship repacked versions or download data from different hosts.</p>
<p>Furthermore, backdoors like <a href="http://blog.sucuri.net/2013/04/apache-binary-backdoors-on-cpanel-based-servers.html" target="_blank">Darkleech</a> can be planted and stay under the radar for a very long time for further malicious activities.</p>
<p>For this reason, detecting compromised hosts as soon as possible has become a critical research topic for our team.</p>
<h1>Traffic spikes as an indicator of compromise</h1>
<p>Anomaly detection is a way to spot some domain names we want to take a closer look at. In particular, we are keeping track of domain names seeing a sudden increase of traffic.</p>
<p style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/Screen-Shot-2013-05-08-at-5.10.15-PM.png"><img class="aligncenter size-full wp-image-4819" alt="Spike" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/Screen-Shot-2013-05-08-at-5.10.15-PM.png" width="550" /></a></p>
<p>There are many legitimate reasons for such a spike to happen, for example a company may be sending a newsletter.  </p>
<p>While being virtually useless as a defense against spam, a lot of mail transfer agents are doing preliminary checks on incoming email.</p>
<p> Whenever an email whose sender is <strong>example.com</strong> is received, these servers check that valid DNS MX or A records for <strong>example.com</strong> are present. They also frequently check that related PTR records map to one of the found IP addresses. TXT queries (for SPF and DomainKey records), as well as checks against DNS-based black lists are also common.</p>
<p>In order to reduce false positives when detecting compromised hosts, we only keep domain names for which we saw no TXT, SPF nor MX queries around the time the spike began. </p>
<p>Another common cause for a spike of traffic is web sites dedicated to specific events.</p>
<p>We use the <a title="Umbrella Security Graph" href="/security-graph/">Umbrella Security Graph</a> to extract three features for domain names observing an abnormal increase of traffic:</p>
<p>- The popularity score, which reflects the number of distinct client IP addresses having looked up a domain name in a short time frame.</p>
<p>- The requester geographic distribution. Benign web sites seeing a spike of traffic after a special event tend to fit our models better than malicious domains.</p>
<p>- The c-rank, which reflects how frequently the domain name has been co-occurring with other domain names known to be malicious. </p>
<p>These features then produce and score classification candidate and updates. During this review, we are looking for obfuscated Javascript code, iframes, and software/modules known to have widely exploited vulnerabilities.</p>
<p>The post <a href="http://labs.umbrella.com/2013/05/10/why-real-time-detection-of-compromised-hosts-has-become-critical/">Why real-time detection of compromised hosts has become critical</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/05/10/why-real-time-detection-of-compromised-hosts-has-become-critical/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Breaking news: Traffic from Syria Disappears from Internet</title>
		<link>http://labs.umbrella.com/2013/05/07/breaking-news-traffic-from-syria-disappears-from-internet/</link>
		<comments>http://labs.umbrella.com/2013/05/07/breaking-news-traffic-from-syria-disappears-from-internet/#comments</comments>
		<pubDate>Tue, 07 May 2013 19:38:47 +0000</pubDate>
		<dc:creator>Dan Hubbard, CTO</dc:creator>
				<category><![CDATA[Innovation]]></category>
		<category><![CDATA[Security Community]]></category>
		<category><![CDATA[Security Research]]></category>

		<guid isPermaLink="false">/?p=4741</guid>
		<description><![CDATA[<p>At around 18:45 UTC the OpenDNS resolvers saw a significant drop in traffic from Syria. On closer inspection, it seems Syria has largely disappeared from the Internet. The graph below shows DNS traffic from and to Syria. The drop in both inbound and outbound traffic from Syria is clearly visible. The small amount of outbound traffic [...]</p><p>The post <a href="http://labs.umbrella.com/2013/05/07/breaking-news-traffic-from-syria-disappears-from-internet/">Breaking news: Traffic from Syria Disappears from Internet</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p dir="ltr">At around 18:45 UTC the <a title="OpenDNS" href="http://www.opendns.com">OpenDNS</a> resolvers saw a significant drop in traffic from Syria. On closer inspection, it seems Syria has largely disappeared from the Internet.</p>
<p dir="ltr">The graph below shows DNS traffic from and to Syria. The drop in both inbound and outbound traffic from Syria is clearly visible. The small amount of outbound traffic depicted by the chart indicates our DNS servers trying to reach DNS servers in Syria.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/syria_offline.png"><img class="aligncenter  wp-image-4775" alt="syria_offline" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/syria_offline-1024x191.png" width="573" height="107" /></a></p>
<p dir="ltr">Currently both TLD servers for Syria, ns1.tld.sy and ns2.tld.sy are unreachable.  The remaining two nameservers sy.cctld.authdns.ripe.net. and pch.anycast.tld.sy. are reachable since they are not within Syria.</p>
<p dir="ltr">Umbrella Security Labs, which is the threat research division of <a title="OpenDNS" href="http://www.opendns.com">OpenDNS</a>, also<a href="http://labs.umbrella.com/2012/12/04/top-10-most-failed-domains-during-syrias-internet-blackout/"> reported</a> on an Internet blackout in Syria November of 2012, where we shared details of the top 10 most failed domains during the outage.  </p>
<p>Expect updates from our team shortly.</p>
<p><strong>Update: 1:28 p.m. PDT</strong></p>
<p dir="ltr">There have been numerous incidents where access to and from the Internet in Syria was shut down. Shutting down Internet access to and from Syria is achieved by withdrawing the BGP routes from Syrian prefixes. The graph below shows the sudden drop in visibility for Syrian network prefixes.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/umbrella-syria-bgp.png"><img class="aligncenter  wp-image-4783" alt="umbrella-syria-bgp" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/umbrella-syria-bgp-1024x324.png" width="430" height="136" /></a></p>
<p dir="ltr"><strong>How it happened:</strong></p>
<p dir="ltr">Routing on the Internet relies on the Border Gateway Protocol (BGP). BGP distributes routing information and makes sure all routers on the Internet know how to get to a certain IP address. When an IP range becomes unreachable it will be withdrawn from BGP, this informs routers that the IP range is no longer reachable.</p>
<p dir="ltr">For example, one of the name servers for the DNS zone .SY is ns1.tld.sy with IP address 82.137.200.85.</p>
<p dir="ltr">Normally our routers would expect a BGP route for 82.137.192.0/18</p>
<p dir="ltr">Currently that route has disappeared and we no longer have a way to reach the Nameservers for .SY that reside in Syria</p>
<p dir="ltr">andree@rtr1-re0.ams&gt; show route 82.137.192.0/18 detail</p>
<p dir="ltr">{master}</p>
<p dir="ltr">Currently there are just three routes in the BGP routing tables for Syria, while normally it’s close to Eighty.  Below are the routes that are still being announced by the major Syrian Telecom provider: AS29256</p>
<p dir="ltr">andree@rtr1-re0.ams&gt; show route aspath-regex &#8220;.* 29256 &#8220;</p>
<p dir="ltr"><b><b> </b></b></p>
<p dir="ltr">inet.0: 447128 destinations, 1696295 routes (446964 active, 5 holddown, 445714 hidden)</p>
<p dir="ltr">+ = Active Route, &#8211; = Last Active, * = Both</p>
<p dir="ltr"><b><b> </b></b></p>
<p dir="ltr">46.53.0.0/17       *[BGP/170] 01:41:57, MED 0, localpref 100</p>
<p dir="ltr">                     AS path: 3356 3320 29386 29256 I</p>
<p dir="ltr">                   </p>
<p dir="ltr">78.110.96.0/20     *[BGP/170] 01:41:57, MED 0, localpref 100</p>
<p dir="ltr">                     AS path: 3356 3320 29386 29256 I</p>
<p dir="ltr">            </p>
<p dir="ltr">94.141.192.0/19    *[BGP/170] 01:41:57, MED 0, localpref 100</p>
<p dir="ltr">                     AS path: 3356 3320 29386 29256 I</p>
<p dir="ltr">Effectively, the shutdown disconnects Syria from Internet communication with the rest of the world. It’s unclear whether Internet communication within Syria is still available. Although we can’t yet comment on what caused this outage, past incidents were linked to both government-ordered shutdowns and damage to the infrastructure, which included fiber cuts and power outages.</p>
<p>&nbsp;</p>
<p>Update 2:</p>
<p>At 14:12 UTC the <a href="http://www.opendns.com">OpenDNS</a> operations team saw traffic come back online in Syria, after 19 hours and 27 minutes of total time offline. More updates shortly. </p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/back-online1.png"><img class="alignnone  wp-image-4811" alt="back-online" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/05/back-online1-1024x181.png" width="553" height="98" /></a></p>
<p>The post <a href="http://labs.umbrella.com/2013/05/07/breaking-news-traffic-from-syria-disappears-from-internet/">Breaking news: Traffic from Syria Disappears from Internet</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/05/07/breaking-news-traffic-from-syria-disappears-from-internet/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>H2O and Streamdrill Meetup</title>
		<link>http://labs.umbrella.com/2013/04/23/after-work-we-have-our-favorite-spot-to-go-to/</link>
		<comments>http://labs.umbrella.com/2013/04/23/after-work-we-have-our-favorite-spot-to-go-to/#comments</comments>
		<pubDate>Tue, 23 Apr 2013 23:28:35 +0000</pubDate>
		<dc:creator>Ping</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Security Research]]></category>

		<guid isPermaLink="false">/?p=4437</guid>
		<description><![CDATA[<p>Tech meetups are the norm in the San Francisco Bay Area. And for San Diego transplants like me, they are probably the best part of the trade-off from 80 degree weather. The Umbrella Security Labs research team tries to take advantage of the many Big Data and Data Mining meetups held right here in the [...]</p><p>The post <a href="http://labs.umbrella.com/2013/04/23/after-work-we-have-our-favorite-spot-to-go-to/">H2O and Streamdrill Meetup</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Tech meetups are the norm in the San Francisco Bay Area. And for San Diego transplants like me, they are probably the best part of the trade-off from 80 degree weather. The Umbrella Security Labs research team tries to take advantage of the many Big Data and Data Mining meetups held right here in the Bay Area, as well as hosting a few right here at the OpenDNS HQ. </p>
<p>A few months ago the Umbrella Security Labs research team <a href="http://www.youtube.com/watch?v=AeITUpKD944">hosted some great speakers</a> from the SF Data Mining group at the OpenDNS HQ. <a href="http://www.meetup.com/Data-Mining">SF Data Mining</a> meet-ups are heavily attended by serious data engineers/scientists, as well as a few amateurs.  The group has been tremendously successful at bringing in big data, machine learning experts, and hosting presentations on the newest techniques, algorithms and products. </p>
<p>Last night we attended another awesome meetup featuring Mikio Braun, who talked about stream mining using streamdrill, and SriSatish Ambati who shared an open source prediction engine called H<sub>2</sub>O. The presentations were so engaging, that we wanted to share the insight here with Umbrella Security Labs blog readers.</p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/d3.jpg"><img class=" wp-image-4459 alignleft" style="margin-left: 10px; margin-right: 10px;" alt="d3" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/d3.jpg" width="243" height="243" /></a></p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/h2o.jpg"><img class="wp-image-4451 alignnone" alt="h2o" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/h2o.jpg" width="243" height="243" /></a></p>
<p>&nbsp;</p>
<p>While Hadoop and the mapreduce framework has served us well in hosting terabytes of data, HBASE type of techniques supporting No-SQL data indexing and query, they are batch-processing in essence. Stream processing frameworks were developed to meet the real-time requirements.</p>
<p>Streamdrill provides a streaming solution for solving top-k problems. Top-k is always one of the immediate queries in most analytical systems. It answers queries like “top-x tweets”, “top-y spammers” or “top-z DNS abusers”. More importantly, it has to answer them in real-time, from a surprisingly large dataset. Mikio’s drawing below illustrates the streaming logic well.</p>
<p style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/steam.png"><img class=" wp-image-4443 aligncenter" alt="steam" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/steam.png" width="432" height="226" /></a> (source: http://blog.mikiobraun.de)</p>
<p>One of the algorithmic practices Mikio shared is <a href="http://en.wikipedia.org/wiki/Count-Min_sketch">Count-Min Sketch</a>.</p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/count-min.jpg"><img class=" wp-image-4449 alignleft" style="margin-left: 7px; margin-right: 7px;" alt="count-min" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/count-min.jpg" width="306" height="306" /></a></p>
<p>&nbsp;</p>
<p>The Umbrella Security Lab applies similar techniques in multiple places. In processing the DNS authoritative logs, coming in @ 1G/min (~1.5 Tb/day), we use <a href="https://github.com/jedisct1/bloom-filter/blob/master/bloom.c">bloom filters</a> to remove duplicates which reduce the data down to several millions of unique records per day. </p>
<p>DataFu’s StreamingQuantile algorithm is also used in our security graph system. A simple usage is shown below. I think that our application of Count-Min sketch in detecting traffic spikes calls for a separate blog post to discuss its technical details in full length. =)</p>
<p>DEFINE Quantile datafu.pig.stats.StreamingQuantile(&#8217;0.999&#8242;);</p>
<p> queries_count_per_client = FOREACH (GROUP raw BY client_ip) {</p>
<p>  GENERATE group AS client_ip, COUNT_STAR(raw) AS n;</p>
<p>  } </p>
<p>pctiles = FOREACH (GROUP queries_count_per_client ALL)</p>
<p>  GENERATE Quantile(queries_count_per_client.n).quantile_0 AS pctile99;</p>
<p>top_client_ips = FOREACH (FILTER queries_count_per_client BY n &gt; pctiles.pctile99) {</p>
<p>  GENERATE client_ip;</p>
<p>};</p>
<p>The above code rescales a popularity score based on DNS queries, it removes entries from IPs having sent more queries than 99.9% of other IPs.</p>
<p>We resonate with SriSatish Ambati&#8217;s desire to enable better predictions by making math to scale. His team’s<a href="http://0xdata.github.io/h2o/"> H2O project</a> is made open source, so math, and statistical learning on bigdata is free. We had a chance to try it out, and after less than 30 minutes of setup, here&#8217;s what we see: </p>
<p style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/Screen-Shot-2013-04-23-at-3.00.27-PM.png"><img class="wp-image-4455 aligncenter" alt="Screen Shot 2013-04-23 at 3.00.27 PM" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/Screen-Shot-2013-04-23-at-3.00.27-PM.png" width="462" height="497" /></a></p>
<p>Pretty neat! We&#8217;re looking forward to more great big data and data mining meetups in San Francisco, and even hosting some at the OpenDNS headquarters.  When the time comes, we&#8217;ll share those details on our blog.</p>
<div> </div>
<div><span style="font-size: 13px;"> </span></div>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p>The post <a href="http://labs.umbrella.com/2013/04/23/after-work-we-have-our-favorite-spot-to-go-to/">H2O and Streamdrill Meetup</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/04/23/after-work-we-have-our-favorite-spot-to-go-to/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Evolution of the Security Venn</title>
		<link>http://labs.umbrella.com/2013/04/22/evolution-of-the-security-venn/</link>
		<comments>http://labs.umbrella.com/2013/04/22/evolution-of-the-security-venn/#comments</comments>
		<pubDate>Mon, 22 Apr 2013 14:02:43 +0000</pubDate>
		<dc:creator>Dan Hubbard, CTO</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Security Research]]></category>

		<guid isPermaLink="false">/?p=4157</guid>
		<description><![CDATA[<p>I have spent a lot of time recently thinking about where information security research still needs to evolve in order to get ahead of the pace of threats and technological disruptions. What’s driving me to rethink the way we approach the problem is both the pace of change and the decreases in efficacy. Security has [...]</p><p>The post <a href="http://labs.umbrella.com/2013/04/22/evolution-of-the-security-venn/">Evolution of the Security Venn</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p dir="ltr">I have spent a lot of time recently thinking about where information security research still needs to evolve in order to get ahead of the pace of threats and technological disruptions. What’s driving me to rethink the way we approach the problem is both the pace of change and the decreases in efficacy. Security has always been described in the terms of balances. There are the balances of efficacy, productivity, scale, performance, and functionality. And on the research side, there has been this balance of science and art. This post takes a look at that balance and what I am calling the “Security Venn of Research”.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><strong>The Security Analyst</strong><br /> <a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.001.png"><img class="alignright size-medium wp-image-4257" alt="HUBBARD_BLOG_APRIL_2013_3.001" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.001-300x225.png" width="300" height="225" /></a></p>
<p dir="ltr">In the early days of security research, reverse engineers spent their time receiving samples from customers (even sometimes via snail mail on floppy disks). The researcher would then perform some analysis on the code to determine its intent and behavior, and then update the protection system with some sort of signature. This was a hash, CRC, or some sort of byte pattern. Although there was certainly *some* science applied here the majority of this manual work was done by a person that formed their opinion based on their research and was heavily art-based and little science.</p>
<p>&nbsp;</p>
<p><strong>Behavior Analysis</strong><br /> <a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.002.png"><img class="alignleft size-medium wp-image-4263" alt="HUBBARD_BLOG_APRIL_2013_3.002" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.002-300x225.png" width="300" height="225" /></a></p>
<p dir="ltr">Years later the Internet was invented and malicious, self-replicating code (worms) started to be more pervasive. The volume of malware increased dramatically and the ability to push updates was available over the Internet. In addition, the research community started using  behavior systems. This allowed researchers the luxury of not having to reverse engineer and analyze every sample they collected and automation was put into place to update their protection much more frequently. This overlap created the Security Venn of a small part science and part art.</p>
<p>&nbsp;</p>
<p><strong>Automation : Automation : Automation</strong><br /> <a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.003.png"><img class="alignright size-medium wp-image-4265" alt="HUBBARD_BLOG_APRIL_2013_3.003" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.003-300x225.png" width="300" height="225" /></a></p>
<p dir="ltr">Next came the rise of the cyber-criminal. This made a dramatic impact on the volume and sophistication of attacks. At the same time researchers were building bigger automation systems, creating more collection mechanisms, and building large clusters of systems to automate analysis. Although this balanced the scale of science and art there was still a lot of tuning, development, and manual classification that was being created and maintained based on short-lived problems and driven, primarily from attack samples.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><strong>Big Data meets Security</strong></p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.004.png"><img class="alignleft size-medium wp-image-4267" alt="HUBBARD_BLOG_APRIL_2013_3.004" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.004-300x225.png" width="300" height="225" /></a></p>
<p dir="ltr">As threats continue to evolve in sophistication and increase in numbers, the reliance on attack samples has caused continued decrease in efficacy. That said, there is a lot of room for continued innovation. We are using technologies from the big data/data mining movement in combination with machine learning and other scientific classification methods based off of DNS data, traffic, and hundreds of features we collect in real-time. This allows for predictive classification with very little human involvement,  post-classification push, and tips the scale towards more science and less art. While reverse-engineering and manual analysis has a role in advanced malware and forensics, it certainly is not equal to <a href="http://info.umbrella.com/Harnessing-Big-Data-for-Advanced-Threat-Protection-Whitepaper.html?utm_source=blog&amp;utm_medium=social&amp;utm_campaign=secven">scientific research</a>.</p>
<p>&nbsp;</p>
<p><strong>Design will increase in importance</strong></p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.005.png"><img class="alignright size-medium wp-image-4269" alt="HUBBARD_BLOG_APRIL_2013_3.005" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/HUBBARD_BLOG_APRIL_2013_3.005-300x225.png" width="300" height="225" /></a></p>
<p dir="ltr">Just like in other technology disciplines, information security can greatly benefit from the data visualization movement in combination with big data. The security version of data viz is sometimes referred to as “Secure-viz” or “sec-viz”. This brings back the art piece of security as the human element may be needed to analyze the data through visualization. The human brain can process graphics in a much more complex way than machines today.</p>
<p><b><b> </b></b></p>
<p dir="ltr">Although it may be dependent on the class of attacks you are researching, it’s clear that the research community is moving towards a balance of science, art, and design as the evolution of the security venn. To learn more about how we do what we do, <a href="http://info.umbrella.com/infographic-using-big-data-for-malware-protection.html?utm_source=blog&amp;utm_medium=social&amp;utm_campaign=secven">take a look at our Infographic</a> on how the Umbrella Security Labs leverages Big Data for predictive threat research.</p>
<p>&nbsp;</p>
<p>The post <a href="http://labs.umbrella.com/2013/04/22/evolution-of-the-security-venn/">Evolution of the Security Venn</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/04/22/evolution-of-the-security-venn/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Meet Jared, a member of the Umbrella Labs Security Community</title>
		<link>http://labs.umbrella.com/2013/04/19/meet-jared-a-member-of-the-umbrella-labs-security-community/</link>
		<comments>http://labs.umbrella.com/2013/04/19/meet-jared-a-member-of-the-umbrella-labs-security-community/#comments</comments>
		<pubDate>Fri, 19 Apr 2013 22:27:47 +0000</pubDate>
		<dc:creator>Vinny</dc:creator>
				<category><![CDATA[Security Community]]></category>
		<category><![CDATA[Security Research]]></category>

		<guid isPermaLink="false">/?p=4217</guid>
		<description><![CDATA[<p>The Umbrella Security Labs research team is searching for smart, passionate Internet security experts to join our community. Why join the community? In the most recent Umbrella Security Labs webcast we exposed how our cutting-edge research team is using advanced Big Data mining tactics to classify and categorize websites. The Labs community members are on [...]</p><p>The post <a href="http://labs.umbrella.com/2013/04/19/meet-jared-a-member-of-the-umbrella-labs-security-community/">Meet Jared, a member of the Umbrella Labs Security Community</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>The Umbrella Security Labs research team is searching for smart, passionate Internet security experts to join our community. Why join the community? In the most recent <a href="http://www.youtube.com/watch?v=wd58EISojh8">Umbrella Security Labs webcast</a> we exposed how our cutting-edge research team is using advanced Big Data mining tactics to classify and categorize websites. The Labs community members are on the front lines of predictive threat research, as they’re able to submit malicious and potentially malicious domains for community review and discussion. The community is made up of security researchers, IT technicians, and software engineers from all around the globe with strong dedication for keeping the Internet safe and secure.  For more information about how the whole process works, <a href="http://blog.opendns.com/2011/05/13/call-for-moderator-applications/">click here</a>.</p>
<p>To underline how valuable Umbrella Labs Security community members are, I want to introduce you to one of the community’s top contributors.  Meet Jared!</p>
<p><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/self_pic.jpg"><img class=" wp-image-4297 alignleft" alt="jaredperry" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/self_pic.jpg" width="192" height="266" /></a></p>
<p><b>OpenDNS: Tell us a bit about yourself.</b></p>
<p>Jared: My name is Jared Perry and I live in St. John&#8217;s on the island of Newfoundland, Canada. I mainly work in the areas of application and desktop security. I enjoy travelling and speaking about IT security topics. I also love to play the occasional real-time strategy such as Starcraft 2. </p>
<p><b>OpenDNS: What initially sparked your interest in internet security?</b></p>
<p>Jared: I have had a lot of teachers in my IT career and learning from them how to build systems initially sparked my interest in Internet security. Seeing how systems can be broken and learning from that drives my interest in IT security to grow.</p>
<p><b>OpenDNS: Have you ever been a victim of a scam or malware?</b></p>
<p>Jared: Early on I had my personal Web server compromised. It was a learning experience, and showed the necessity for multiple layers of defense when building any system. </p>
<p><b>OpenDNS: What&#8217;s the  most convincing scam that you&#8217;ve ever seen?</b></p>
<p>Jared: The AV industry :p, just kidding. I see so many scams in the run of the day and none are really that convincing if you follow some basic rules. Many are not coherent, have terrible grammar or are completely out of place. I think that the more subtle scams like Twitter/Facebook/Mobile apps that are &#8220;grey&#8221; at best and siphon off your personal information are probably the most convincing, as even the most aware users often don&#8217;t think twice about these types of apps.</p>
<p><b>OpenDNS: Besides OpenDNS, what are your favorite tools of the trade to use?</b></p>
<p>Jared: My absolute favorite tool is <a title="Burp Suite from Portswigger" href="http://www.portswigger.net/burp/" target="_blank">Burp Suite from Portswigger</a>, its like the Sriracha of security tools &#8211; you can use it on everything. I use it for application security testing and also a proxy for malware communications. <a title="Thug" href="https://github.com/buffer/thug" target="_blank">Thug</a> is also a pretty cool tool that I have been using lately as it saves me a lot of time.</p>
<p><b>OpenDNS: Do u currently have any security certificates like CISSP or GIAC from SANS.</b></p>
<p>Jared: I currently have two GIAC certifications, GIAC security essentials certification (GSEC) and GIAC Web application pen tester certification (GWAPT). <a href="http://www.giac.org/certified-professional/jared-perry/126579">http://www.giac.org/certified-professional/jared-perry/126579</a></p>
<p><b>OpenDNS: What do you most enjoy about being a member of our security community ?</b></p>
<p>Jared: I enjoy contributing to malware tagging at OpenDNS because it gives data back to the community and helps end users. It&#8217;s also a reason for me to develop scripts and learn new tools to tackle malware.</p>
<p><b>OpenDNS: Can you demonstrate or explain an interesting case that you moderated?</b></p>
<p>Jared: Ransomware these days I find interesting, I can only imagine an end user dealing with being locked out of there computer by this malware and some now even have webcam integration to scare the user even further. Its concerning when you find semi-legit services processing payments for these criminal groups. Its more rewarding when you send the data to services like OpenDNS to block these malware networks and report them to payment processors who hopefully freeze the accounts.</p>
<p><b>OpenDNS: What&#8217;s for lunch?</b></p>
<p>Jared: Fish and brewis! <a href="https://en.wikipedia.org/wiki/Fish_and_brewis">https://en.wikipedia.org/wiki/Fish_and_brewis</a></p>
<p>If you’re interested in joining Jared and the Umbrella Labs Security Community, just <a href="http://community.opendns.com/domaintagging/malware_application.php">apply online</a> and let us know why you’d make a great contributor to the community.</p>
<p>The post <a href="http://labs.umbrella.com/2013/04/19/meet-jared-a-member-of-the-umbrella-labs-security-community/">Meet Jared, a member of the Umbrella Labs Security Community</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/04/19/meet-jared-a-member-of-the-umbrella-labs-security-community/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>On the trail of malicious dynamic DNS domains</title>
		<link>http://labs.umbrella.com/2013/04/15/on-the-trail-of-malicious-dynamic-dns-domains/</link>
		<comments>http://labs.umbrella.com/2013/04/15/on-the-trail-of-malicious-dynamic-dns-domains/#comments</comments>
		<pubDate>Mon, 15 Apr 2013 13:47:02 +0000</pubDate>
		<dc:creator>Dhia</dc:creator>
				<category><![CDATA[Security Research]]></category>
		<category><![CDATA[threat discovery]]></category>

		<guid isPermaLink="false">/?p=3819</guid>
		<description><![CDATA[<p>Dynamic DNS is a useful technology that allows a domain name to point to Internet resources hosted on changing public IP addresses. Consider an individual or small business with a dynamic IP who needs to provide consistent content or services publicly advertised to the outside world via a domain name (e.g. website, FTP server, mail [...]</p><p>The post <a href="http://labs.umbrella.com/2013/04/15/on-the-trail-of-malicious-dynamic-dns-domains/">On the trail of malicious dynamic DNS domains</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Dynamic DNS is a useful technology that allows a domain name to point to Internet resources hosted on changing public IP addresses. Consider an individual or small business with a dynamic IP who needs to provide consistent content or services publicly advertised to the outside world via a domain name (e.g. website, FTP server, mail server, game room, webcam monitoring, etc). That&#8217;s where dynamic DNS helps out. Typically, these customers use the IP assigned to them by their ISP, and every time their IP changes, they notify their dynamic DNS provider to update its name servers so that the customer’s domain points now to the new IP. The notification happens through a client software installed on the customer’s router/computer or via an HTTP restful API. One such client software is <a href="http://dnsomatic.com/">DNSOMATIC</a> by OpenDNS.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/dnsomatic_logo_2000.gif"><img class=" wp-image-3985 aligncenter" alt="dnsomatic_logo_2000" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/dnsomatic_logo_2000.gif" width="181" height="59" /></a></p>
<p dir="ltr">Unfortunately, the convenience of dynamic DNS did not go unnoticed by miscreants, who have been abusing free, dynamic DNS to perform various attacks such as <a href="http://threatpost.com/en_us/blogs/report-malvertising-campaign-thrives-dynamic-dns-021113">large-scale malvertising</a>, and targeted <a href="http://threatpost.com/en_us/blogs/spear-phishing-campaigns-use-fake-mandiant-apt1-report-lure-022113">spear-phishing</a>, which both resulted in drive-by downloads, and use it for <a href="http://blog.threatstop.com/tag/dynamic-dns/">botnet C&amp;C</a>. For attackers, using dynamic DNS constitutes another agile evasion technique against IP blacklisting. It also allows them to deliver malicious payloads from constantly-changing hosting IPs, be it infected individuals’ computers or compromised public websites. To circumvent domain blacklisting, attackers can also use randomly-generated disposable subdomains under the dynamic DNS domain to point to the next hop in a redirection chain or to the final malware hosting IP. This seems similar to fast flux, although from a definition standpoint they are different. For dynamic DNS, the dynamic IP is supposed to fall in the IP range of the ISP (1 or a few ASNs), whereas, with fast flux, a domain will be pointing to an increasing number of different IPs scattered across numerous ASNs and multiple geographical locations. Additionally, for dynamic DNS, the authoritative name servers for a dynamic DNS domain physically belong to the dynamic DNS provider, whereas with fast flux, double fluxing is possible where the name servers can be made point to constantly changing IPs of physical hosts located in disparate ASNs and countries.  In practice, dynamic DNS domains map to a much smaller set of IPs than fast flux.</p>
<p dir="ltr">In this blog, we discuss the relationship between dynamic DNS domains and malware as we see it through mining our large DNS data sets. This can also give some perspective on how to address the problem of rogue dynamic DNS domains.</p>
<p dir="ltr"><strong>Dynamic DNS analysis</strong></p>
<p dir="ltr">There are plenty of dynamic DNS providers, both free and for a cost. One good list of them is available <a href="http://dnslookup.me/dynamic-dns/">here</a>. </p>
<p dir="ltr">Dynamic DNS providers offer users to either register domains (2LDs), or subdomains (3LDs) under a predefined set of domains (2LDs). For instance, changeIP.com has a list of 155 domains, under which a user can freely register any subdomain of his choice (if it is available). For example, they have 1dumb.com and 2waky.com as pre-registered domains, and a user can register the hostnames johndoe.1dumb.com or myhomebusiness.2waky.com. changeIP also offers to users to register a domain under the following TLDs .com, .net, .info, .org, .biz, or .us. This latter choice requires an annual registration fee though. Similar offers are available from other providers like no-ip.com, afraid.org, Dyn.com (formerly known as DynDNS), etc. The common practice for attackers is to abuse the free subdomains. </p>
<p dir="ltr">For this study, we are interested in evaluating the amount of dynamic DNS domains we see in our daily authoritative DNS traffic and the percentage of malicious domains within, and also find out which subdomains are the most frequently abused. </p>
<p dir="ltr">First, we collect a sample of known malicious dynamic DNS domains, then, we compile a list of known pre-registered domains offered by a few dynamic DNS providers. For the malicious sample, the dynamic DNS providers that are mostly used are sitelutions.com, noip.com, changeip.com, and dnsdynamic.org. For the general list, we select known dynamic DNS providers such as: changeip.com, dnsdynamic.org, noip.com, freedns.afraid.org, dyndns.com, sitelutions.com, and 3322.org. These samples are not exhaustive as there are a lot more dynamic DNS providers (and more of them are abused). Some dynamic DNS providers are not limited to offering dynamic DNS services and act also as regular domain registrars, so a domain registered with a dynamic DNS provider and using its name servers might not necessarily be using the dynamic DNS service. We think, however, that these samples are representative enough for the sake of the analysis. </p>
<p dir="ltr">Next, we resolve the NS (name servers) of all domains in both samples. This list of name servers will be used to filter out the daily logs to identify domains using dynamic DNS. The logic here is that if we already know about a set of dynamic DNS domains, we can identify their name servers, and any new domain that uses these latter name servers will be assumed to be a dynamic DNS domain. The name servers from the general list give a trend on the percentage of total dynamic DNS domains in daily traffic, whereas, the name servers from the malicious sample provide an idea on the dynamic DNS traffic most likely to be malicious. The name servers associated with the sample of malicious dynamic DNS domains are: ns[1-3].changeip.org, ns[1-5].changeip.com, ns[1,2].dnsdynamic.org, nf[1-5].no-ip.com, and ns[1-5].sitelutions.com.</p>
<p dir="ltr">In the next step, we collect sample authoritative DNS logs from three resolvers in London, Ashburn and Singapore, where we have for every domain, its associated authoritative name server(s). For each day, we collect a sample of  1,518,782 domains on average with their name servers data. We collect logs for a week, then for each day, we identify those domains whose name servers fall within the list of name servers of dynamic DNS providers.</p>
<p dir="ltr">Finally, we compare the identified dynamic DNS domains against our blacklist (which is constantly updated with new data), and we show the results in the figures below. For the sake of this discussion, we call sortecielo.2waky.com a hostname, or subdomain or 3LD and 2waky.com a domain or 2LD. We can see in the figures, that there are 30,000+ dynamic DNS hostnames (3LDs) observed daily in the sample authoritative DNS traffic, and 3000+  corresponding domains (2LDs). For the same period, out of the same daily domain sets, we identify 1400+ malicious hostnames, and 200+ associated domains every day. This gives an idea about the density of the associations between a domain and its &#8220;children&#8221; subdomains.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/dyn-dns-3lds-2lds-week.png"><img class="wp-image-3869 aligncenter" alt="dyn-dns-3lds-2lds-week" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/dyn-dns-3lds-2lds-week.png" width="451" height="294" /></a></p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/dyn-dns-3lds-2lds-mal-week.png"><img class="wp-image-3877 aligncenter" alt="dyn-dns-3lds-2lds-mal-week" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/dyn-dns-3lds-2lds-mal-week.png" width="451" height="294" /></a></p>
<p dir="ltr"><strong>Top abused dynamic DNS domains</strong></p>
<p dir="ltr">In the following tables, we show the top 20 domains observed in daily traffic over a week as well as the top 20 domains used for malicious purposes over the same period. The counts next to the domain represent the number of hostnames under that domain. For example, on the first day, disqus had 18,294 hostnames of the form subdomain.disqus.com</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/top-20-dynDNS-2LDs-week.png"><img class="wp-image-3881 aligncenter" alt="top-20-dynDNS-2LDs-week" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/top-20-dynDNS-2LDs-week.png" width="587" height="183" /></a></p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/top-20-dynDNS-mal-2LDs-week.png"><img class="wp-image-3891 aligncenter" alt="top-20-dynDNS-mal-2LDs-week" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/top-20-dynDNS-mal-2LDs-week.png" width="586" height="184" /></a></p>
<p dir="ltr">In the next table, we show side by side, for a single day the top 20 dynamic DNS domains in general traffic and those that had malicious hostnames. We indicate in red, those domains that are present in both top malicious domains and top popular domains in a daily DNS traffic i.e. no-ip.org, no-ip.biz, no-ip.info, hopto.org, dlinkddns.com, myftp.org, myvnc.com, myftp.biz, and us.to. What is noteworthy is that some popular dynamic DNS domains for general legitimate uses are also the top ones abused for malicious purposes. This makes blocking the entire domain a little tricky as that would deny visibility to a lot of legitimate content. Notice that the dynamic DNS provider no-ip.com is the most used one for both legitimate and malicious intent. The domains no-ip.org, no-ip.biz, no-ip.info, hopto.org, myftp.org, myvnc.com, and myftp.biz all use no-ip name servers. The right hand table for top malicious domains is illustrated at the end of this blog as a graph representation.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/top-20-all-traffic-top-20-mal-same-day.png"><img class="wp-image-3901 aligncenter" alt="top-20-all-traffic-top-20-mal-same-day" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/top-20-all-traffic-top-20-mal-same-day.png" width="243" height="267" /></a></p>
<p>&nbsp;</p>
<p dir="ltr">[top 20 domains in general traffic on the left, and top 20 malicious domains on the right]</p>
<p>In the next table, we show the percentage of malicious usage of hostnames under each domain. For example, 56.71% of the 3LDs under hopto.org are malicious. Clearly, some domains are heavily used for malicious purposes.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/percentage-mal-usage-top-20-dynDNS-2LDs.png"><img class=" wp-image-3913 aligncenter" alt="percentage-mal-usage-top-20-dynDNS-2LDs" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/percentage-mal-usage-top-20-dynDNS-2LDs.png" width="542" height="29" /></a></p>
<p dir="ltr">Below, we show an illustrative graph of the mapping of hostnames to domains taken from the list of detected malicious dynamic DNS domains of one day. The largest connected component on the top left corner is that of the domain hopto.org which has 245 malicious 3LDs associated with it, e.g. spilak.hopto.org, arasispodmoonf.hopto.org, 1n12.hopto.org, etc. On the right of hopto.org is the cluster of no-ip.org with 125 malicious 3LDs, then no-ip.info on the right with 103 hostnames, etc.</p>
<p dir="ltr" style="text-align: center;"><a href="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/3lds-to-2lds-mal-graph-1-day.png"><img class=" wp-image-3935 aligncenter" alt="3lds-to-2lds-mal-graph-1-day" src="https://d1l5aqlryxcb4m.cloudfront.net/wp-content/uploads/2013/04/3lds-to-2lds-mal-graph-1-day.png" width="511" height="541" /></a></p>
<p dir="ltr">We further took a sample of hostnames under hopto.org, and we determined that they were used to serve urls for Fragus Exploit kit, Best Pack Exploit kit, Incognito Exploit kit, Java and PDF exploits, leading to Trojan Fake AVs downloads. They were also used as CnC for W32/Dorkbot-EK, Rogue:Win32/Winwebsec, Trojan-Ransom.Win32.Mbro.ysw, IRC botnets, and also to serve phishing urls. In another sample, we observe that malicious dynamic DNS domains are massively associated with Blackhole exploits kit, Neosploit exploits, PDF exploits, and other exploits leading to the delivery of rogue antivirus, trojans, Backdoor SDBot, etc. It is worth mentioning that it is difficult to trace back the registration information of dynamic DNS domains that are in the form of subdomain.[predefined domain].tld because the whois information only records the registration information of the domain (the 2LD).</p>
<p dir="ltr">Note: The tools and platform I used for this study are our <a title="How the infrastructure behind the OpenDNS global network powers Umbrella reporting" href="/2013/03/19/the-opendns-log-analysis-infrasctructure/">Hadoop dev cluster</a>, <a title="Why we love Apache Pig" href="/2013/04/08/pig-jruby/">Apache Pig</a>, Python, and Unix shell tools (sed, awk, grep, etc).</p>
<p>The post <a href="http://labs.umbrella.com/2013/04/15/on-the-trail-of-malicious-dynamic-dns-domains/">On the trail of malicious dynamic DNS domains</a> appeared first on <a href="http://labs.umbrella.com">Umbrella Security Labs</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://labs.umbrella.com/2013/04/15/on-the-trail-of-malicious-dynamic-dns-domains/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using memcached
Page Caching using memcached
Database Caching 5/12 queries in 0.006 seconds using memcached
Object Caching 1112/1151 objects using memcached
Content Delivery Network via Amazon Web Services: CloudFront: d1l5aqlryxcb4m.cloudfront.net

 Served from: labs.umbrella.com @ 2013-05-25 22:14:04 by W3 Total Cache -->