<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="http://cs.pervasive.com/utility/FeedStylesheets/rss.xsl" media="screen"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:slash="http://purl.org/rss/1.0/modules/slash/" xmlns:wfw="http://wellformedweb.org/CommentAPI/"><channel><title>Pervasive Community Site</title><link>http://cs.pervasive.com/blogs/</link><description /><dc:language>en-US</dc:language><generator>CommunityServer 2007 SP1 (Build: 20510.895)</generator><item><title>We have moved</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/11/15/we-have-moved.aspx</link><pubDate>Tue, 15 Nov 2011 15:58:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:49315</guid><dc:creator>ijgarcia</dc:creator><slash:comments>0</slash:comments><description>We have moved our blogs &lt;a href="http://www.pervasivedatarush.com/Blogs/BigDataBlogs"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=49315" width="1" height="1"&gt;</description></item><item><title>Making The Most Out Of Your Data: Big Data Opportunities</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/10/31/making-the-most-out-of-your-data-big-data-opportunities.aspx</link><pubDate>Mon, 31 Oct 2011 13:46:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:49191</guid><dc:creator>David_Inbar</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;&lt;span style="font-size:11pt;"&gt;Last month, &lt;b&gt;Forrester Research&lt;/b&gt; released a report, &amp;quot;&lt;/span&gt;&lt;a href="http://www.forrester.com/rb/Research/expand_digital_horizon_with_big_data/q/id/60751/t/2" target="_blank"&gt;&lt;span style="font-size:11pt;"&gt;Expand Your Digital Horizon With Big Data&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:11pt;"&gt;,” for CIOs that focused on how they
should approach &lt;b&gt;big data&lt;/b&gt; in order to
take full advantage of it for their businesses. It addressed how big data is
influencing markets across industries and is prevalent in various business
sectors, such as healthcare, web marketing and telecommunications. The report
also discussed several factors for companies to consider when working with
their data, such as redefining approaches to using data beyond traditional BI
tools.&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;&lt;span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;&lt;/span&gt;&lt;/p&gt;
&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;&lt;/span&gt;
&lt;p class="MsoNormal"&gt;&lt;span style="font-size:11pt;"&gt;Key points included
how major potential challenges of big data include not only the cost of the
technology but also the shortage of data scientists. Companies are starting to
seek professionals with big data skills, stimulated by new pressures to scale
large volumes of unstructured data. According to a &lt;/span&gt;&lt;a href="http://www.forbes.com/sites/danwoods/2011/10/11/emc-greenplums-steven-hillion-on-what-is-a-data-scientist/" target="_blank"&gt;&lt;span style="font-size:11pt;"&gt;recent Forbes article by Quentin Hardy&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:11pt;"&gt;, the “data scientist” phenomenon has
recently started to emerge in big data conversations with debates currently
over its definition, the skill set required and the data scientist’s role in
the big data trend. &lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoNormal"&gt;&lt;span style="font-size:11pt;"&gt;Forrester analyst
Brian Hopkins, who co-authored the report with Boris Evelson, Sharyn Leaver,
Connie Moore, Alex Cullen, Mike Gilpin and Mackenzie Cahill, &lt;a href="http://blogs.forrester.com/brian_hopkins/11-09-30-big_data_will_help_shape_your_markets_next_big_winners" target="_blank"&gt;posted a brief summary of the report on his blog&lt;/a&gt;, &lt;/span&gt;&lt;span style="font-size:11pt;"&gt;emphasizing three key questions for companies to address in order to understand
and create a big data plan&lt;b&gt;: 1) What is
new about big data? 2) What is it? and 3) How will it influence our market?&lt;/b&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoNormal"&gt;&lt;span style="font-size:11pt;"&gt;&lt;b&gt;&lt;/b&gt;&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;&lt;br /&gt;Some interesting data points, findings and recommendations from the report
include:&lt;/span&gt;&lt;span style="font-size:11pt;font-family:&amp;#39;Times New Roman&amp;#39;,&amp;#39;serif&amp;#39;;"&gt;&lt;/span&gt;
&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpFirst" style="text-indent:-0.25in;margin-left:40px;"&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-fareast-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;&lt;span style="mso-list:Ignore;"&gt;1.&lt;span style="font:7.0pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;Forrester surveyed 60 of their clients
who are using or experimenting with big data computing. 75% of the surveyed
clients responded that data volume was the main reason for looking into big
data solutions. 58% of respondents in Forrester&amp;#39;s June 2011 Global Big Data
Online Survey reported interest in insight driven by an analytics approach.&lt;/span&gt;&lt;span style="font-size:11.0pt;font-family:&amp;#39;Times New Roman&amp;#39;,&amp;#39;serif&amp;#39;;"&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="text-indent:-0.25in;margin-left:40px;"&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-fareast-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;&lt;span style="mso-list:Ignore;"&gt;2.&lt;span style="font:7.0pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;70% of respondents expressed interest in
big data for managing current enterprise information. Therefore, many early
adopters are using big data solutions to understand existing information and
not new data sources.&lt;/span&gt;&lt;span style="font-size:11.0pt;font-family:&amp;#39;Times New Roman&amp;#39;,&amp;#39;serif&amp;#39;;"&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="text-indent:-0.25in;margin-left:40px;"&gt;&lt;span style="font-size:11pt;"&gt;&lt;span&gt;3.&lt;span style="font:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;Cost is the underlying theme for big data
challenges. These challenges include:&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="text-indent:-0.25in;margin-left:80px;"&gt;&lt;span style="font-size:11pt;font-family:Symbol;"&gt;&lt;span&gt;·&lt;span style="font:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;&lt;span style="font-weight:bold;"&gt;Volume&lt;/span&gt;, in terms of data amount exceeding how it can be stored cost-effectively&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="text-indent:-0.25in;margin-left:80px;"&gt;&lt;span style="font-size:11pt;font-family:Symbol;"&gt;&lt;span&gt;·&lt;span style="font:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;&lt;span style="font-weight:bold;"&gt;Velocity&lt;/span&gt;, in terms of processing data fast enough for businesses to respond and adapt rapidly&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="text-indent:-0.25in;margin-left:80px;"&gt;&lt;span style="font-size:11pt;font-family:Symbol;"&gt;&lt;span&gt;·&lt;span style="font:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;&lt;span style="font-weight:bold;"&gt;Variety&lt;/span&gt;, in terms of integration costs of adding new data feeds and interpreting variable data structures&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="text-indent:-0.25in;margin-left:80px;"&gt;&lt;span style="font-size:11.0pt;font-family:Symbol;mso-fareast-font-family:Symbol;mso-bidi-font-family:Symbol;"&gt;&lt;span style="mso-list:Ignore;"&gt;·&lt;span style="font:7.0pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;&lt;span style="font-weight:bold;"&gt;Variability&lt;/span&gt;, in terms of complex and highly variable data structures that complicate analysis&lt;/span&gt; &lt;br /&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="text-indent:-0.25in;margin-left:40px;"&gt;&lt;span style="font-size:11pt;"&gt;&lt;span&gt;4.&lt;span style="font:7pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11pt;"&gt;Big data is still in its early stage, and
thus serves as a challenge to businesses. To succeed and overcome the
challenges, businesses should:&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpFirst" style="margin-left:1.0in;mso-add-space:auto;text-indent:-.25in;mso-pagination:none;mso-list:l2 level1 lfo3;tab-stops:11.0pt .5in;mso-layout-grid-align:none;text-autospace:none;"&gt;&lt;span style="font-size:11.0pt;font-family:Symbol;mso-fareast-font-family:Symbol;mso-bidi-font-family:Symbol;"&gt;&lt;span style="mso-list:Ignore;"&gt;·&lt;span style="font:7.0pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;Encourage
collaboration between business and IT&lt;/span&gt;&lt;span style="font-size:11.0pt;font-family:&amp;#39;Times New Roman&amp;#39;,&amp;#39;serif&amp;#39;;"&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpMiddle" style="margin-left:1.0in;mso-add-space:auto;text-indent:-.25in;mso-pagination:none;mso-list:l0 level1 lfo2;tab-stops:11.0pt .5in;mso-layout-grid-align:none;text-autospace:none;"&gt;&lt;span style="font-size:11.0pt;font-family:Symbol;mso-fareast-font-family:Symbol;mso-bidi-font-family:Symbol;"&gt;&lt;span style="mso-list:Ignore;"&gt;·&lt;span style="font:7.0pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;Create new,
agile and compliant processes and approaches to deliver big data solutions&lt;/span&gt;&lt;span style="font-size:11.0pt;font-family:&amp;#39;Times New Roman&amp;#39;,&amp;#39;serif&amp;#39;;"&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoListParagraphCxSpLast" style="margin-left:1.0in;mso-add-space:auto;text-indent:-.25in;mso-pagination:none;mso-list:l1 level1 lfo1;tab-stops:11.0pt .5in;mso-layout-grid-align:none;text-autospace:none;"&gt;&lt;span style="font-size:11.0pt;font-family:Symbol;mso-fareast-font-family:Symbol;mso-bidi-font-family:Symbol;"&gt;&lt;span style="mso-list:Ignore;"&gt;·&lt;span style="font:7.0pt &amp;#39;Times New Roman&amp;#39;;"&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;
&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:Calibri;"&gt;Adapt
quickly to fast-paced, changing technology trends for rapid growth&amp;nbsp;&lt;/span&gt;&lt;span style="font-size:11.0pt;font-family:&amp;#39;Times New Roman&amp;#39;,&amp;#39;serif&amp;#39;;"&gt;&lt;/span&gt;&lt;/p&gt;


&lt;p class="MsoNormal" style="mso-pagination:none;tab-stops:11.0pt .5in;mso-layout-grid-align:none;text-autospace:none;"&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:&amp;#39;Times New Roman&amp;#39;;"&gt;As
the big data trend continues to flourish throughout industries, companies that
tend to have a handle on managing and using their data are able to develop
forward-looking strategies and gain a competitive advantage over their
competitors. CIOs are facing increased pressure to figure out ways to make use
of the company’s data to achieve meaningful insight for valuable business
decisions. As a result, this brings up an important question to address: how is
the CIO’s role evolving as new big data technologies emerge and IT spending in
big data increases? Is your business feeling the pressure to join the
competitive big data crowd?&lt;/span&gt;&lt;/p&gt;

&lt;p class="MsoNormal" style="mso-pagination:none;tab-stops:11.0pt .5in;mso-layout-grid-align:none;text-autospace:none;"&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:&amp;#39;Times New Roman&amp;#39;;"&gt;&lt;br /&gt;Pervasive
Big Data encompasses &lt;/span&gt;&lt;a href="http://www.pervasivedatarush.com/" target="_blank"&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:&amp;#39;Times New Roman&amp;#39;;"&gt;Pervasive’s DataRush big data
software platform&lt;/span&gt;&lt;/a&gt;&lt;span style="font-size:11.0pt;mso-ascii-font-family:Calibri;mso-hansi-font-family:Calibri;mso-bidi-font-family:&amp;#39;Times New Roman&amp;#39;;"&gt;
for companies to consume high volume and variable data for complex
analysis.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;We have been watching this big
data trend grow for the past two years and have built our tools to help with
the challenges of processing and analyzing big data.&lt;span style="mso-spacerun:yes;"&gt; &lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=49191" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/trend/default.aspx">trend</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/challenges/default.aspx">challenges</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Forrester+research/default.aspx">Forrester research</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/unstructured+data/default.aspx">unstructured data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/data+scientist/default.aspx">data scientist</category></item><item><title>Is Your Big Data Problem Solved Yet?</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/10/20/is-your-big-data-problem-solved-yet.aspx</link><pubDate>Thu, 20 Oct 2011 15:21:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:49081</guid><dc:creator>livey</dc:creator><slash:comments>0</slash:comments><description>
&lt;p&gt;We’re certainly more informed these days.  And even my grandmother is hearing of the challenges of big data.  But after listening to DM Radio last week, &lt;a href="http://www.information-management.com/dmradio/-10019295-1.html" target="_blank"&gt;Avoiding Bottlenecks and Hurdles in Data Delivery&lt;/a&gt;, the big data crisis appears to be subsiding according to &lt;a href="http://twitter.com/#%21/prussom" target="_blank"&gt;Philip Russom&lt;/a&gt;.  Or is it?
&lt;/p&gt;
&lt;p&gt;Russom points out that the biggest bottleneck in big data is moving the data from processing into data flow.  Old ways of processing data relied on the hardware.  The faster the hardware then the faster the processing…right? &lt;a href="http://twitter.com/#%21/Pervasive" target="_blank"&gt;   Pervasive&lt;/a&gt;’s Big Data Director &lt;a href="http://twitter.com/#%21/davidinbar" target="_blank"&gt;David Inbar&lt;/a&gt; pointed out that old software IS the bottleneck.  Organizations are throwing more hardware to handle slow processes when it’s poorly written software causing the problem.  
&lt;/p&gt;
&lt;p&gt;So we have organizations that have overhauled their technology to stay ahead of the curve, but they haven’t updated their software to process in parallel with their new technology.  Do they still sell single-core work machines?  If it’s more than a single core machine then your software needs to process in parallel for the most high performance efficiency.  Parallel processing is the ability to carry out multiple operations simultaneously.  And parallel processing allows organizations to expand and handle bottlenecks of data traffic.  Once you’ve parallelized your working environment, is your big data problem solved?  Probably not.  
&lt;/p&gt;
&lt;p&gt;I’m not sure if the big data crisis is subsiding OR if big data awareness is more prevalent.  Organizations are certainly starting to experience the IT side of big data, but what about educating the persons running the analytics? When Jeff Kelly wrote &lt;a href="http://siliconangle.com/blog/2011/09/26/data-scientists-are-rocking-the-big-data-world/" target="_blank"&gt;&lt;i&gt;Data Scientists Are Rocking the Big Data World&lt;/i&gt;&lt;/a&gt;, he mentions that there are few formal training and educational programs that focus on big data and analytics.  To get the right answer, you have to ask the right question.  And to ask the right question, you have to understand the quality of your data.  
&lt;/p&gt;
&lt;p&gt;Hopefully parents in the high tech industry are coaxing their children and their educational system to jump on the big data train today because I have a feeling that it’s going to be a long ride.    
&lt;/p&gt;
&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=49081" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Multicore/default.aspx">Multicore</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Analytics/default.aspx">Analytics</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Bottlenecks/default.aspx">Bottlenecks</category></item><item><title>Webinar: Big Data and Hadoop with guest speaker Jim Kobielus</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/08/31/webinar-big-data-and-hadoop-with-guest-speaker-jim-kobielus.aspx</link><pubDate>Wed, 31 Aug 2011 20:27:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:48575</guid><dc:creator>aschmidt</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;&lt;span class="Apple-style-span" style="font-family:Georgia,&amp;#39;Times New Roman&amp;#39;,&amp;#39;Bitstream Charter&amp;#39;,Times,serif;font-size:13px;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:19px;orphans:2;text-indent:0px;text-transform:none;white-space:normal;widows:2;word-spacing:0px;"&gt;&lt;p&gt;To help&amp;nbsp;enterprises learn more about big data and how it fits with their traditional data warehouse and data mart,&lt;span class="Apple-converted-space"&gt;&amp;nbsp;&lt;/span&gt;&lt;a href="http://www.pervasivedatarush.com/" target="_blank"&gt;Pervasive DataRush&lt;/a&gt;&lt;span class="Apple-converted-space"&gt;&amp;nbsp;&lt;/span&gt;and&lt;span class="Apple-converted-space"&gt;&amp;nbsp;&lt;/span&gt;&lt;a href="http://www.karmasphere.com/" target="_blank"&gt;Karmasphere&lt;/a&gt;&amp;nbsp;are hosting a&lt;span class="Apple-converted-space"&gt;&amp;nbsp;&lt;/span&gt;&lt;a href="http://lp.pervasive.com/BigDataWebinarWithJamesKobielus.html" target="_blank"&gt;webinar&lt;/a&gt;&amp;nbsp;titled, &amp;quot;Big Data: The Role, Value and Best Practices of Hadoop,&amp;quot; taking place September 7&amp;nbsp;at 9:00 a.m. PDT/1:00 p.m. EDT (for the Americas), and September 8&amp;nbsp;at 3:00 p.m. BST (for Europe).&amp;nbsp;During the webinar, guest speaker&amp;nbsp;&lt;a href="http://www.forrester.com/rb/analyst/james_kobielus" target="_blank"&gt;James G. Kobielus&lt;/a&gt;, senior analyst with Forrester&amp;nbsp;Research, Inc., will discuss the concept of big data, and the emerging software platform used for big data: Hadoop. Following the discussion, Pervasive DataRush and Karmasphere will each give a short overview of their big data offerings.&lt;/p&gt;&lt;p&gt;&lt;a href="http://www.dataintegrationblog.com/wp-content/uploads/2011/08/James-Kobielus.jpg"&gt;&lt;/a&gt;&lt;a href="http://cs.pervasive.com/blogs/datarush/James-Kobielus.jpg"&gt;&lt;img src="http://cs.pervasive.com/blogs/datarush/James-Kobielus.jpg" style="width:121px;height:182px;" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Attendees can expect to learn:&lt;/p&gt;&lt;p&gt;- Where Hadoop adds the most business value.&lt;br /&gt;- Best practices for using Hadoop.&lt;br /&gt;- How companies are combining Hadoop with standard data warehouses and data marts.&lt;br /&gt;- The Forrester model of the Hadoop ecosystem.&lt;/p&gt;&lt;p&gt;It should be a lively discussion. If you would like to attend the webinar, please register here:&amp;nbsp;&lt;a href="http://lp.pervasive.com/BigDataWebinarWithJamesKobielus.html"&gt;http://lp.pervasive.com/BigDataWebinarWithJamesKobielus.html&lt;/a&gt;&lt;/p&gt;&lt;/span&gt;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=48575" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataRush/default.aspx">Pervasive DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive/default.aspx">Pervasive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Big++Data/default.aspx">Big  Data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/James+Kobielus/default.aspx">James Kobielus</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoo/default.aspx">Hadoo</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Karmasphere/default.aspx">Karmasphere</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Forrester/default.aspx">Forrester</category></item><item><title>BIG Thoughts on BIG Data </title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/08/03/big-thoughts-on-big-data.aspx</link><pubDate>Wed, 03 Aug 2011 20:59:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:48364</guid><dc:creator>livey</dc:creator><slash:comments>1</slash:comments><description>&lt;p&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;We had the pleasure of meeting with David Linthicum last month to get his thoughts on Big Data.&lt;/font&gt;&lt;/font&gt;&lt;/p&gt;&lt;a title="Big Data Trends with David Linthicum" href="http://www.youtube.com/watch?v=_Nshx9LhGhw" target="_blank"&gt;&lt;/a&gt;
&lt;p&gt;&lt;a title="David Linthicum discusses big data" href="http://www.youtube.com/watch?v=_Nshx9LhGhw" target="_blank"&gt;&lt;img border="0" src="http://cs.pervasive.com/blogs/datarush/Linthicumvideo.png" alt="" /&gt;&lt;/a&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;&lt;font size="3" face="Calibri"&gt;Many organizations are facing the reality that they have more data coming in than they can process data.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;Apache Hadoop offers a big opportunity for businesses, but, &lt;/font&gt;&lt;a title="Big Data Trends with David Linthicum" href="http://www.youtube.com/watch?v=_Nshx9LhGhw" target="_blank"&gt;&lt;font size="3" face="Calibri"&gt;according to David Linthicum&lt;/font&gt;&lt;/a&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;, enterprises are in an experimental phase with Hadoop right now trying to learn how it works and how they can best use it.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;&lt;/font&gt;&lt;/font&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;/font&gt;&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;The open-source yellow elephant has spawned a large following.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;Yahoo! has recently launched a Hadoop-spinoff company called Hortonworks as a prime player in the Big Data game to launch enterprise-quality products.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;&lt;/font&gt;&lt;/font&gt;&lt;a title="Hortonworks OSCON Followup" href="http://www.hortonworks.com/takeaways-from-oscon-2011/" target="_blank"&gt;&lt;font size="3" face="Calibri"&gt;Hortonworks presented recently&lt;/font&gt;&lt;/a&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;at OSCON Data 2011 where they revealed their current goals on Hadoop security and framework scalability. &lt;/font&gt;&lt;/font&gt;&lt;/p&gt;&lt;font size="3"&gt;
&lt;p&gt;&lt;font face="Calibri"&gt;Software vendors are working on filling in the gaps in Hadoop to solve business problems for the enterprise.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;Cloudera is aiming to make the Hadoop experience pain-free with its fully integrated Hadoop distribution (CDH) and management suite; Karmasphere provides big data analytics solutions; Pervasive DataRush optimizes Hadoop jobs to increase performance on any platform.&lt;/font&gt;&lt;/p&gt;&lt;/font&gt;
&lt;p&gt;&lt;font size="3" face="Calibri"&gt;Where is Hadoop going?&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;We are paving the path as we speak.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;Stay tuned to this blog site&amp;nbsp;for further details.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;And follow the latest industry news on Pervasive&amp;#39;s &lt;/font&gt;&lt;a href="http://www.pervasive.com/OnlineNewsDigest.aspx" target="_blank"&gt;&lt;font color="#0000ff" size="3" face="Calibri"&gt;Big Data Digest&lt;/font&gt;&lt;/a&gt;!&lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;/font&gt;&lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=48364" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoop/default.aspx">Hadoop</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/David+Linthicum/default.aspx">David Linthicum</category></item><item><title>Capture All Your Data, All The Time</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/07/19/capture-all-your-data-all-the-time.aspx</link><pubDate>Wed, 20 Jul 2011 02:44:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:48269</guid><dc:creator>livey</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;We are in Washington D.C. this week exhibiting at the &lt;a href="http://www.fose.com/Events/FOSE-2011/Home.aspx" target="_blank"&gt;FOSE&lt;/a&gt; show and we&amp;#39;re showing this awesome demo that we&amp;#39;ve been working on.  Pervasive has taken &lt;a href="http://www.pervasivedatarush.com/" target="_blank"&gt;Pervasive DataRush&lt;/a&gt; to new levels in order to meet specific &lt;a href="http://www.dhs.gov/files/cybersecurity.shtm" target="_blank"&gt;cyber security&lt;/a&gt; challenges.  It’s no secret that we’ve entered the big data era. There are millions of devices generating data every second: log events, security events, network traffic, firewalls, and so much more (this various is shown as shapes in the diagram below).  And there’s lots of great software out there to look at these events, but only for a short time frame.  One of the most daunting challenges facing organizations today is capturing, archiving, and analyzing ALL this data at any given time.  There’s so much data that today’s software is failing to archive and analyze cyber security events as a WHOLE.&lt;/p&gt;
&lt;p&gt;Until now.&lt;/p&gt;
&lt;p&gt;Pervasive has developed a Historical Event Processing proof of concept (POC) that leverages and exposes the power of Pervasive DataRush as it captures and archives one million events per second into &lt;a href="http://hbase.apache.org/" target="_blank"&gt;Hadoop’s HBase&lt;/a&gt;.  Holy smokes, that’s amazing!  This consumption rate is orders of magnitude faster than any solution on the market today. &lt;/p&gt;
&lt;p&gt;For this POC, we used a single server box with 48 cores, 40 drives, and 258G of memory.   But the processing rate will increase with increased cores or multiple nodes.   We used Pervasive DataRush listeners for multiple log events and archivers to write to any database.  And we actually captured 1.6 million events in 62 seconds, to be exact. Once the millions of events are captured and archived, you can use Pervasive DataRush to launch any set of queries or apply data mining algorithms to perform deep analytics on the dataset as a whole to look for a change in patterns...particularly useful in cyber security.  For this POC, we used a  &lt;a href="http://hive.apache.org/" target="_blank"&gt;Hive&lt;/a&gt; query to count server process types that generated each event and calculate percentages.  Once the query was completed, Pervasive DataRush sent counts of each message to Google Charting to display the data visually.  The entire process helps meet the demands of today&amp;#39;s cyber security challenges, but we&amp;#39;re especially impressed about the speed that we&amp;#39;re able to capture this events and run queries.&lt;/p&gt;
&lt;p&gt;If we captured a million events in one minute, imagine how much data is created everyday.  Now organizations can capture all the data, archive it, and perform deep analytics as a whole. Stop by our booth #1528 at FOSE to see the POC in person.&lt;/p&gt;
&lt;img src="http://cs.pervasive.com/blogs/datarush/Historical-event-processing-demo.png" alt="" /&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=48269" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/cyber+security/default.aspx">cyber security</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+processing/default.aspx">multicore processing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/event+capturing/default.aspx">event capturing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/historical+event+processing/default.aspx">historical event processing</category></item><item><title>Pervasive Not Only Presenting But Introducing Pervasive TurboRush for Hive at Hadoop Summit </title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/06/22/pervasive-not-only-presenting-but-introducing-pervasive-turborush-for-hive-at-hadoop-summit.aspx</link><pubDate>Wed, 22 Jun 2011 14:46:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:48064</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;Pervasive Software’s Chief Integration Technologist Paul Dingman will be a presenter at &lt;a href="http://developer.yahoo.com/events/hadoopsummit2011/agenda.html"&gt;Hadoop Summit 2011&lt;/a&gt; on June 29, 2011, in Santa Clara, CA. Paul is participating in the Application and Research track and will present from 1:15 pm to 1:45 pm PDT. Paul&amp;#39;s presentation is &lt;a href="http://developer.yahoo.com/events/hadoopsummit2011/agenda.html#21"&gt;&amp;quot;Hadoop on a Personal Supercomputer.&amp;quot; &lt;/a&gt;&lt;/p&gt;
&lt;p&gt;He will discuss the potential advantages of running Hadoop on single many-core machines with large disk arrays, illustrating cases where Hadoop running on one, or a few, fat nodes can deliver faster results and be more cost effective than Hadoop running on a greater number of lower end machines. He will also discuss opportunities to exploit intra-server parallelism to improve task communication and coordination overhead.&lt;/p&gt;
&lt;p&gt;&lt;a href="http://developer.yahoo.com/events/hadoopsummit2011/agenda.html"&gt;The Yahoo Hadoop Summit Agenda is now posted—and you can register online.&lt;br /&gt;&lt;/a&gt;&amp;nbsp;&lt;br /&gt;&lt;strong&gt;Big News to Expect at the Hadoop Summit: We’ll Introduce Pervasive TurboRush™ for Hive&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;As more companies adopt Apache Hadoop software for big data preparation and analytics, many are using Apache Hive for its ability to run SQL-like queries and analyze large datasets stored in the Hadoop file system.&amp;nbsp; &lt;/p&gt;
&lt;p&gt;We saw the opportunity to provide a “turbocharger” to make Hive queries run more efficiently, without developers having to modify them or learn another tool. At the Hadoop Summit the Pervasive DataRush team will introduce &lt;a href="http://www.pervasivedatarush.com/Products/TurboRushforHive.aspx"&gt;Pervasive TurboRush for Hive.&lt;/a&gt;&amp;nbsp;Queries run faster on less hardware, all without code changes.&amp;nbsp;Stay tuned!&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;You’ll also hear news about the Pervasive DataRush, Community Edition for Hadoop&lt;/strong&gt; – a version of Pervasive DataRush for developers doing prototyping in Hadoop and looking to get better performance in their Map/Reduce cluster jobs. If you’re prototyping a Hadoop job and you’re sensitive to scaling and containing costs, this edition is for you. We’ll offer users a managed community site with forum, as well as support. &lt;br /&gt;&amp;nbsp;&lt;br /&gt;&lt;em&gt;&lt;strong&gt;And Don’t Forget:&lt;/strong&gt;&lt;/em&gt; &lt;strong&gt;BigDataCamp the night before the Hadoop Summit&lt;/strong&gt; &lt;/p&gt;
&lt;p&gt;The evening before Hadoop Summit 2011 users of Hadoop and other big data-related technologies will exchange ideas in a fast-paced, peer-driven “unconference.” BigDataCamp will be a knowledge transfer and networking opportunity for data engineers, enterprise architects, developers, analysts, data miners and business intelligence professionals. Led by &lt;a href="http://www.cloudcamp.org/"&gt;CloudCamp&amp;#39;s&lt;/a&gt; Dave Nielsen, attendees will share their thoughts in open discussions with pre-defined and majority-vote topics, including best practices in application development and advanced analytics. Pervasive DataRush Chief Technologist Jim Falgout will present a brief lightning talk, as will Concurrent Founder and CTO Chris Wensel, Foursquare Engineer Ben Lee and others.&amp;nbsp; &lt;/p&gt;
&lt;p&gt;Supported by Pervasive DataRush, Aster Data, Concurrent and EMC, BigDataCamp will be on June 28, 2011, from 5:30 PM-10:00 PM (PT), at the Network Meeting Center at Techmart. &lt;/p&gt;
&lt;p&gt;Tickets are going fast so you may want to &lt;a href="http://bigdatacamp-santaclara-2011.eventbrite.com/"&gt;register now.&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;br /&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=48064" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataRush/default.aspx">Pervasive DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel+processing/default.aspx">parallel processing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Datarush+team/default.aspx">Datarush team</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive/default.aspx">Pervasive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush+engine/default.aspx">DataRush engine</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelism/default.aspx">parallelism</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelizing/default.aspx">parallelizing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelization/default.aspx">parallelization</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel/default.aspx">parallel</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Developers/default.aspx">Developers</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel+programming/default.aspx">parallel programming</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoop/default.aspx">Hadoop</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Big++Data/default.aspx">Big  Data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoop+Summit/default.aspx">Hadoop Summit</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hive/default.aspx">Hive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+TurboRush+for+Hive/default.aspx">Pervasive TurboRush for Hive</category></item><item><title>Big Data Analytics Digest: Receive the latest news, thought leadership and Pervasive Big Data Analytics activity through RSS</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/06/17/big-data-analytics-digest-receive-the-latest-news-thought-leadership-and-pervasive-big-data-analytics-activity-through-rss.aspx</link><pubDate>Fri, 17 Jun 2011 22:12:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:48024</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;If you’ve explored our website lately, you may have noticed that we now offer a daily news aggregator called &lt;a href="http://www.pervasive.com/OnlineNewsDigest.aspx"&gt;Big Data Analytics Digest&lt;/a&gt;. It’s accessible on our&lt;a href="http://www.pervasivedatarush.com/"&gt; home page&lt;/a&gt;, too. &lt;/p&gt;
&lt;p&gt;We avidly follow the torrent of news and commentary in the realm of Big Data Analytics, so we rolled up our sleeves and created the digest to give our customers, industry watchers and analysts a glimpse of what we find interesting in the ever-changing Big Data Analytics scene—including breaking Hadoop-related news.&lt;/p&gt;
&lt;p&gt;So, if you subscribe, what will you get?&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Links to the latest Big Data news from the likes of BeyeNetwork, Computerworld, Enterprise Irregulars, Forrester, GigaOM, Infoworld, IT Business Edge, O’Reilly Radar, ReadWriteWeb, SDTimes, TechCrunch, The 451 Group and others.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Thought leadership from the leading names in Big Data Analytics, including Pervasive DataRush Chief Technologist Jim Falgout and Pervasive CTO Mike Hoskins.&lt;/strong&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;(FYI – if you haven’t had a chance to read Jim’s widely read article on enhancing existing applications with embedded analytics in eWeek, &lt;a href="http://www.eweek.com/c/a/Enterprise-Applications/How-to-Enhance-Existing-Applications-with-Embedded-Analytics/"&gt;take a look&lt;/a&gt;.) &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;A source for upcoming Big Data industry events and presentations &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Technical resources&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Updates on the companies and products shaping Big Data Analytics&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pervasive DataRush news PLUS updates from our Innovation Labs&amp;nbsp; &lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;In addition, we offer a comprehensive blogroll that will keep you in touch with the leading bloggers on the Big Data Analytics market. &lt;/p&gt;
&lt;p&gt;I’m open to all suggestions and criticisms to make the digest even better. Just drop me a line at &lt;a href="mailto:joe.dubin@pervasive.com"&gt;joe.dubin@pervasive.com&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;br /&gt;Joe Dubin, &lt;br /&gt;Pervasive DataRush Product Manager&lt;br /&gt;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=48024" width="1" height="1"&gt;</description></item><item><title>Pervasive DataRush Chief Technologist Jim Falgout to Speak at AMD Fusion Summit on June 14 </title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/06/03/pervasive-datarush-chief-technologist-jim-falgout-to-speak-at-amd-fusion-summit-on-june-14.aspx</link><pubDate>Fri, 03 Jun 2011 17:20:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:47965</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;&lt;strong&gt;Jim Discusses Leveraging Multicore Systems for Hadoop and HPC Workloads&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Check out Pervasive DataRush Chief Technologist Jim Falgout at &lt;a href="http://developer.amd.com/afds/pages/default.aspx"&gt;the AMD Fusion Developer Summit&lt;/a&gt; June 13-16 at Meydenbauer Center in Bellevue, WA. The Summit includes more than 90 technology sessions across eight Technology Topics. AMD tech leaders, industry experts, and members of academia lead the sessions. &lt;/p&gt;
&lt;p&gt;Jim’s presentation, Session 1421 slated for Tuesday June 14th at 5:15 pm, will center on the critical importance of unlocking the parallelism of multi-core servers and clusters, particularly Hadoop-based clusters, to solve big data problems. Despite the promise of scaled-out hardware, users are encountering long processing times and complexities building MapReduce jobs. &lt;/p&gt;
&lt;p&gt;Jim argues that the right approach to solving these challenges is to better exploit the performance potential of multicore. His presentation includes examples of Hadoop and HPC workloads running on clusters and single multi-core servers—including web analytics and bioinformatics—detailing the performance and energy efficiency gains that are possible with scaling up, as well as out. &lt;/p&gt;
&lt;p&gt;&lt;a href="http://www.pervasivedatarush.com/"&gt;Learn more about the technology behind his approach.&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;As mentioned, the Summit covers eight technology topics including:&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Developer Tools:&lt;/strong&gt; Covers development tools ranging from compilers and debuggers to performance visualization tools. Sessions cover the state of the art in compiler technology (CPU and GPU), debugging and profiling OpenCL™, and automatic data movement.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Enterprise Computing:&lt;/strong&gt; Features sessions that discuss using multicore technology to handle large data, showcase software being developed today utilizing multicore CPUs, and show early work of applying the data parallel capabilities of GPUs to databases.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;High Performance Computing:&lt;/strong&gt; Presents a sampling of portable and standards based heterogeneous computing. Come see innovative uses of GPUs, extreme optimizations, power efficient implementations, benchmarks, libraries, and real world applications in physics, chemistry, finance and rendering.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Multimedia Processing:&lt;/strong&gt; Sessions on image processing, audio processing, video processing, telepresence, video quality enhancement, computer vision, transcoding, content recognition, image retrieval, multimedia algorithm optimization for parallel processing and codecs.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Professional Graphics and Visual Computing:&lt;/strong&gt; Focuses on various areas of visual computing, including mixed-mode OpenGL/DX/OpenCL™ interoperability, and advanced rendering and compute techniques. &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Programming Models:&lt;/strong&gt; Showcases the state of the art in parallel programming models and techniques for heterogeneous platforms. Topics covered include: programming models for next generation GPU architectures and techniques for building domain specific languages on heterogeneous platforms.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Security:&lt;/strong&gt; Sessions on password recovery and audit, encryption, and steganography detection.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;User Interface and Media Experiences:&lt;/strong&gt; A focus on gesture recognition, touch recognition, face recognition, UIs for new user experiences, video management, video playback, and Web user experiences.&lt;/p&gt;
&lt;p&gt;There’s something for everyone. The Pervasive DataRush team hopes to see you at &lt;a href="http://developer.amd.com/afds/pages/default.aspx"&gt;the AMD Fusion Developer Summit! &lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=47965" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataRush/default.aspx">Pervasive DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+revolution/default.aspx">multicore revolution</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Multicore/default.aspx">Multicore</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/AMD/default.aspx">AMD</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel+processing/default.aspx">parallel processing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Datarush+team/default.aspx">Datarush team</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive/default.aspx">Pervasive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelism/default.aspx">parallelism</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+processors/default.aspx">multicore processors</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+Software/default.aspx">Pervasive Software</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multi-threaded+applications/default.aspx">multi-threaded applications</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoop/default.aspx">Hadoop</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/MapReduce/default.aspx">MapReduce</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Big++Data/default.aspx">Big  Data</category></item><item><title>A Detailed Summary of McKinsey Report on Big Data</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/06/02/a-detailed-summary-of-mckinsey-report-on-big-data.aspx</link><pubDate>Thu, 02 Jun 2011 17:14:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:47952</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>5</slash:comments><description>&lt;p&gt;We wanted to share a detailed summary of &lt;a href="http://www.mckinsey.com/mgi/publications/big_data/index.asp"&gt;the report on Big Data by McKinsey Global Institute (MGI)&lt;/a&gt; that was released last month, as it contains relevant points and interesting statistics.&lt;br /&gt;&amp;nbsp;&lt;br /&gt;The report describes the state and growing role of digital data that has now entered every sector and economy, as well as the impact of the growing amount of data. MGI claims that there is strong evidence that Big Data can contribute significantly to national economies, creating substantial value for the overall world economy. Their research suggests that the public sector can increase its productivity through effective use of Big Data. For instance, the value to the US healthcare system could be $300 billion a year, and US retailers could boost their operating profit margins by 60 percent. However, MGI notes the challenges organizations face with reaching the full potential of Big Data, such as limited analytical and managerial talent to make big data advantageous and valuable for businesses.&lt;br /&gt;&amp;nbsp;&lt;br /&gt;Some relevant key findings from the research include:&lt;/p&gt;
&lt;p&gt;1.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; MGI estimates the new data stored by enterprises exceeded 7 exabytes of data globally in 2010. In addition, the new data stored by customers around the world exceeded 6 exabytes in the same year.&lt;/p&gt;
&lt;p&gt;2.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Organizations are using Big Data analytics more to make decisions by analyzing datasets, including from mobile and social networks, on customers, employees and sensors embedded in products. This is leading to innovation of new business models, products and services. For example, there will be a better match between products and customers.&lt;/p&gt;
&lt;p&gt;3.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; The use of Big Data will encourage new growth opportunities and competition among businesses.&lt;/p&gt;
&lt;p&gt;4.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Some sectors are positioned for greater gains from the use of Big Data. Those sectors include computer and electronic products, financial and insurance, and government. Public sectors, such as the education vertical, have experienced negative productivity growth due to high systemic barriers.&lt;/p&gt;
&lt;p&gt;5.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Concerns over the use of Big Data include data policies, privacy issues, developing new techniques and technologies, organizational change and talent, access to and integration of information from various data sources, and industry structure.&lt;/p&gt;
&lt;p&gt;6.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Under techniques for analyzing Big Data, MGI listed various methods from statistics and machine learning for data mining, including association rule learning, cluster analysis and classification. Data fusion and data integration are also significant techniques that allow analysis of data from multiple databases to extract valuable insight.&lt;/p&gt;
&lt;p&gt;7.&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp; Visualization supports Big Data and is becoming more important to portray information in a consumable way for people to understand.&lt;br /&gt;&amp;nbsp;&lt;br /&gt;Please see below for a direct link to the full report. The report includes an overview of Big Data techniques and technologies, the potential of Big Data and key findings in five types of verticals (health care, public sector administration, retail, manufacturing and personal location data), and the implications for organization leaders and policy makers.&lt;br /&gt;&amp;nbsp;&lt;br /&gt;McKinsey Global Institute, May 2011&lt;br /&gt;&lt;a href="http://www.mckinsey.com/mgi/publications/big_data/index.asp"&gt;Big data: The next frontier for innovation, competition, and productivity&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=47952" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Predictive+Analytics/default.aspx">Predictive Analytics</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataRush/default.aspx">Pervasive DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush+engine/default.aspx">DataRush engine</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/dataflow+model/default.aspx">dataflow model</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/datarush-analytics/default.aspx">datarush-analytics</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/data-intensive/default.aspx">data-intensive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoop/default.aspx">Hadoop</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Analytics/default.aspx">Analytics</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Big++Data/default.aspx">Big  Data</category></item><item><title>Tackling Big Data is a Big Job</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/05/10/tackling-big-data-is-a-big-job.aspx</link><pubDate>Tue, 10 May 2011 14:37:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:47766</guid><dc:creator>livey</dc:creator><slash:comments>0</slash:comments><description>&lt;font size="3"&gt;&lt;font face="Calibri"&gt;Tackling &lt;strong&gt;big data&lt;/strong&gt; is not a job that is only going to be solved by programmers alone.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;It’s going to be solved in concert with data scientists and analysts.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;What tools exist for both programmers and non-programmers?&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;We love KNIME for this and we love DataRush for KNIME even more.&lt;/font&gt;&lt;/font&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;/font&gt;&lt;/font&gt;&lt;font size="3" face="Calibri"&gt;&amp;nbsp;&lt;/font&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;a title="KNIME" href="http://www.knime.org/" target="_blank"&gt; &lt;br /&gt;&lt;br /&gt;KNIME&lt;/a&gt;, an open-source data mining and visualization tool, allows users to visually create data flows, execute selected analysis steps, and later investigate the results through interactive views on data and models.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;KNIME’s drag-and-drop interface gives non-programmers a chance to easily build work flows and store the analysis process for later expansion.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;&lt;a title="DataRush for KNIME" href="http://www.pervasivedatarush.com/Products/DataRushforKNIME.aspx" target="_blank"&gt;DataRush for KNIME&lt;/a&gt; allows big data analytics to be processed in a faction of the time without writing scripts.&lt;/font&gt;&lt;/font&gt; &lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;br /&gt;&lt;br /&gt;This Wednesday, May 11, Pervasive DataRush and KNIME will host a &lt;a title="Scalable Data Analytics: Tools for Big Data Mining and Visualization" href="http://lp.pervasive.com/PDR-KNIME-Webex-May2011.html" target="_blank"&gt;joint webinar&lt;/a&gt;, &lt;strong&gt;Scalable Data Analytics: Tools for&amp;nbsp;Big Data Mining and Visualization&lt;/strong&gt;, &amp;nbsp;that will show &lt;strong&gt;KNIME’s intuitive user interface&lt;/strong&gt; and demo &lt;strong&gt;how&lt;/strong&gt; &lt;strong&gt;DataRush accelerates KNIME’s already powerful data mining capabilities.&lt;/strong&gt;&lt;span style="mso-spacerun:yes;"&gt;&lt;strong&gt;&amp;nbsp;&lt;/strong&gt; &lt;/span&gt;Michael Berthold, CEO of KNIME, will show interactive analytics and visualizations produced with the KNIME platform.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;With DataRush for KNIME, Davin Potts will show how anyone who can use a mouse can now use DataRush to maximize the full power of cores in their server without needing to be a programmer.&lt;/font&gt;&lt;/font&gt;&lt;font size="3" face="Calibri"&gt;&amp;nbsp;&lt;/font&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;br /&gt;&lt;br /&gt;Making available the power of DataRush through a tool like KNIME means empowering data scientists, analysts, and programmers alike to help tackle big data analytics.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;If you miss the webinar, please &lt;a title="Email Us" href="mailto:%20info@pervasivedatarush.com" target="_blank"&gt;email us&lt;/a&gt; for a recorded version of the live event.&lt;/font&gt;&lt;/font&gt; &lt;/font&gt;&lt;/font&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=47766" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/data+mining/default.aspx">data mining</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/KNIME/default.aspx">KNIME</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Analytics/default.aspx">Analytics</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Big++Data/default.aspx">Big  Data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/webinar/default.aspx">webinar</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/programmers/default.aspx">programmers</category></item><item><title>Hitting the East Coast this week:  GigaOM Structure Big Data 2011</title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/03/21/hitting-the-east-coast-this-week-gigaom-structure-big-data-2011.aspx</link><pubDate>Mon, 21 Mar 2011 15:12:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:47265</guid><dc:creator>livey</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;&lt;font size="2" face="Arial"&gt;Pervasive Software is a proud sponsor and exhibitor of &lt;a href="http://event.gigaom.com/bigdata/" target="_blank"&gt;GigaOM’s Structure Big Data 2011&lt;/a&gt; taking place on March 23 in New York City.&amp;nbsp; As noted on their website, The Structure Big Data conference is designed to get you up to speed on how to make money using the data already locked in your organization. &lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font size="2" face="Arial"&gt;In our booth at GigaOM Structure Big Data, we’ll be showing the latest release of &lt;a href="http://www.pervasivedatarush.com/" target="_blank"&gt;Pervasive DataRush&lt;/a&gt;,&amp;nbsp;a software framework for building high-performance applications for big data.&amp;nbsp;&amp;nbsp; Organizations can develop big data applications that run faster, cost less, and use less energy – whether on a cluster or a single server, on premises or in the cloud.&amp;nbsp; Check out this &lt;a href="http://www.youtube.com/watch?v=LtmpkSN3pZc&amp;amp;mkt_tok=3RkMMJWWfF9wsRow5%2FmYJoDpwmWGd5mht7VzDtPj1OY6hBouIbmJK1TtuMFUGpsqOPmbExQRAJl3xQ%3D%3D" target="_blank"&gt;video&lt;/a&gt; of Jim Falgout discussing the latest version of Pervasive DataRush and where it can be used.&amp;nbsp; Not only are we exhibiting, but our CTO and general manager of Integration Products, Mike Hoskins, will participate on the panel titled, “The Many Faces of Map Reduce - Hadoop and Beyond,” taking place at 1:30 p.m. EST.&amp;nbsp; Mike will be discussing parallelism and big data.&amp;nbsp; Mike is always atop the latest industry trends and will be available to meet and discuss any questions you may have.&amp;nbsp;&amp;nbsp; If you’re not already registered and would like to attend,&amp;nbsp; you can &lt;a href="http://mkto-h0008.com/track?type=click&amp;amp;enid=bWFpbGluZ2lkPXBlcnZhc2l2ZUJldGFjdXN0LS0tLTIwMzUtcHJvZC0xMDQxMiZtZXNzYWdlaWQ9MCZkYXRhYmFzZWlkPTEwNDEyJnNlcmlhbD0xMjY0NDUyNzEwJmVtYWlsaWQ9bGl2ZXlAcGVydmFzaXZlLmNvbSZ1c2VyaWQ9MCZleHRyYT0mJiY=&amp;amp;&amp;amp;&amp;amp;http://bigdata2011.eventbrite.com/?discount=SPEAKFNF&amp;amp;mkt_tok=3RkMMJWWfF9wsRow5%2FmYJoDpwmWGd5mht7VzDtPj1OY6hBouIbmJK1TtuMFUGpsqOPmbExQRAJl3xQ%3D%3D" target="_blank"&gt;save $100 off registration by using this link&lt;/a&gt;.&amp;nbsp;&amp;nbsp; &lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&lt;font size="2" face="Arial"&gt;GigaOM is sure to be a success and we hope to see you there! &lt;/font&gt;&lt;/p&gt;
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=47265" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive/default.aspx">Pervasive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Big++Data/default.aspx">Big  Data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Mike+Hoskins/default.aspx">Mike Hoskins</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Structure+Big+Data/default.aspx">Structure Big Data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/GigaOM/default.aspx">GigaOM</category></item><item><title>Takeaways from the O'Reilly Strata Conference   </title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/02/18/takeaways-from-the-o-reilly-strata-conference.aspx</link><pubDate>Fri, 18 Feb 2011 21:09:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:46960</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;Attending the O&amp;#39;Reilly Strata Conference, I received lots of food for thought about the future of Big Data, as well as further validation that Pervasive DataRush&lt;sup&gt;TM&lt;/sup&gt; is a good framework&lt;i&gt; &lt;/i&gt;to respond to many of the information explosion challenges now or soon to be facing organizations. Here are some of my takeaways from this insightful event.&lt;/p&gt;
&lt;p&gt;Joe Dubin&lt;br /&gt;Manager, Product Marketing&lt;br /&gt;Pervasive DataRush&lt;/p&gt;
&lt;p&gt;&lt;b&gt;Information is Black Gold&lt;/b&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://www.metamarketsgroup.com/team.php"&gt;Metamarkets&lt;/a&gt; CTO Mike Driscoll told technology executives to think ‘oil&amp;#39; when it comes to information. Driscoll, quoting Gartner&amp;#39;s Peter Sondergaard, stresses, &amp;quot;Information will be the ‘oil of the 21st century&amp;#39;. It will be the resource running our economy in ways not possible in the past.&amp;quot; Driscoll describes Big Data as the ‘tar sands&amp;#39; of the information economy, containing valuable stores of information, but that are expensive to extract. Once extracted, the challenge is to analyze the data, using it to learn and predict.&amp;nbsp; &lt;/p&gt;
&lt;p&gt;Driscoll sees three major forces driving Big Data: &lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Ubiquitous sensor networks&lt;/strong&gt; (mobile phones, as an example).&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Cloud computing obviating the need to manage compute power.&lt;/strong&gt; Drawing an analogy to an electric grid, Driscoll says that businesses don&amp;#39;t invest capital in power generation, and the cloud enables a similar trend for compute power.&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Machine learning&lt;/strong&gt;, with Driscoll citing the progress made in the &lt;a href="http://en.wikipedia.org/wiki/DARPA_Grand_Challenge"&gt;DARPA grand challenge&lt;/a&gt; and the Netflix prize.&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;Other emerging trends Driscoll has a pulse on are:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;&lt;b&gt;The Need for Data Scientists:&lt;/b&gt; Already in short supply, the demand for data scientists is growing. Companies are looking for those with interdisciplinary skills in math, statistics, bioinformatics, physics, programming (and hacking) skills, and, above all, curiosity. In fact, many speakers ended their presentations with a message to data scientists: &lt;b&gt;&lt;i&gt;We&amp;#39;re hiring&lt;/i&gt;&lt;/b&gt;. &lt;/li&gt;
&lt;li&gt;&lt;b&gt;The Rise of Data Publishers (i.e., the&lt;strong&gt; reassertion of control by data producers)&lt;/strong&gt;&lt;/b&gt;: Companies recognize the value of their own data and are pulling back from third-party data processors. &lt;/li&gt;
&lt;li&gt;&lt;b&gt;The End of Privacy (or the Rethinking of Privacy):&lt;/b&gt; The view that visibility of personal data can be restricted is shifting to one inclined to restricting allowed usage of that data. In other words, policing usage will become more prevalent.&lt;/li&gt;
&lt;li&gt;&lt;b&gt;The Rise of Data Start-ups:&lt;/b&gt; A class of companies is emerging whose supply chains consist of nothing but data. Their inputs are collected through partnerships or from publicly available sources, processed, and transformed into traffic predictions, news aggregations, or real estate valuations. &lt;b&gt;Data start-ups are the wildcatters of the information age,&lt;/b&gt; searching for opportunities across the data landscape.&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;Data science, Driscoll firmly believes, can solve big problems to organizations-namely, making sense of the world and scaling-up decision making.&amp;nbsp; As a case in point, he cites the use of &lt;a href="http://www.newyorker.com/reporting/2011/01/24/110124fa_fact_gawande?currentPage=all"&gt;data mining to reduce health care costs by identifying the neediest patients and improving their health care&lt;/a&gt;. &lt;/p&gt;
&lt;p&gt;&lt;a href="http://www.dataspora.com/blog/mining-the-tar-sands-of-big-data/#more-121"&gt;Read more of Driscoll&amp;#39;s commentary.&lt;/a&gt; &lt;/p&gt;
&lt;p&gt;&lt;b&gt;Traditional BI and Applications are Complimentary&lt;/b&gt;&lt;/p&gt;
&lt;p&gt;Dr. Barry Devlin of &lt;a href="http://www.9sight.com/"&gt;9sight Consulting&lt;/a&gt;, an industry founder of the DW industry with over 30 years in DW and BI, suggests that &lt;b&gt;Traditional BI&lt;/b&gt; (and its database-centric approach), with its emphasis on consistency, traceability, and data quality, and &lt;b&gt;Applications&lt;/b&gt;, built on technologies like Hadoop and MapReduce, with support for large, rapidly changing datasets, are complementary approaches to handling the information explosion organizations face. &lt;/p&gt;
&lt;p&gt;I found it interesting that Dr. Devlin presented Traditional BI and Applications as two worlds which need to-and can-work together. Our product &lt;a href="http://www.pervasivedatarush.com/"&gt;Pervasive DataRush&lt;/a&gt; works in both spaces-on the traditional BI side, it&amp;#39;s the basis for our Pervasive DataMatcher and Pervasive Data Profiler products and for applications, Pervasive DataRush integrates with Hadoop, accelerating MapReduce jobs up to 10x.&lt;/p&gt;
&lt;p&gt;&lt;a href="http://assets.en.oreilly.com/1/event/55/The%20Data-driven%20Business%20and%20Other%20Lessons%20from%20History%20Presentation.pdf"&gt;Take a look at Dr. Devlin&amp;#39;s slide presentation.&lt;/a&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;The Multicore Crisis and Emerging Technologies &lt;/b&gt;&lt;br /&gt;One of the highlights of Strata was listening to &lt;a href="http://thirdnature.net/about_us.html"&gt;Third Nature&lt;/a&gt; President Mark Madsen&amp;#39;s survey of new technologies, particularly technology innovations and systems powering the analytic database landscape today. Madsen underscored the multicore crisis and the end of Moore&amp;#39;s law free lunch as major factors in shaping data technology.&amp;nbsp; &lt;/p&gt;
&lt;p&gt;Pre-2005, Madsen says, the trend was for CPU manufacturers to increase clock rates with every new chip, and everyone&amp;#39;s software would automatically run faster. But increasing speed also increases power consumption and heat generated. As a remedy, CPU makers moved towards putting multiple cores on a chip. Madsen, however, points out, &amp;quot;Putting more engines in your car doesn&amp;#39;t make it go faster; you need to redesign to take advantage of them. Achieving multicore performance is fundamentally different than getting a free boost from clock rates increasing.&amp;quot;&amp;nbsp; I couldn&amp;#39;t agree more!&lt;/p&gt;
&lt;p&gt;Companies operating at petabyte scale like Google, EBay and Twitter are the exception to the norm, Madsen states. Most companies in need of Big Data Analytics have less than six terabytes of data, and he finds that the computational needs of data analytics are pushing companies from running on PCs into SMP servers and clusters. &lt;b&gt;I would add that &lt;/b&gt;&lt;a href="http://www.pervasivedatarush.com/Solutions/DataRushforAnalytics.aspx"&gt;&lt;b&gt;Pervasive DataRush&lt;/b&gt;&lt;/a&gt;&lt;b&gt; can help here.&lt;/b&gt; &lt;/p&gt;
&lt;p&gt;Mark was throwing out insights faster than I could write them down... thankfully,&lt;a href="http://assets.en.oreilly.com/1/event/55/Determine%20the%20Right%20Analytic%20Database_%20A%20Survey%20of%20Data%20Technologies%20and%20Products%20Presentation%201.pdf"&gt; his slides&lt;/a&gt; are available. I recommend taking a more in-depth look. I know I will.&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=46960" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataRush/default.aspx">Pervasive DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/data+mining/default.aspx">data mining</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+Software/default.aspx">Pervasive Software</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoop/default.aspx">Hadoop</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/MetaMarkets/default.aspx">MetaMarkets</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Data+Scientists/default.aspx">Data Scientists</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Data+Publishers/default.aspx">Data Publishers</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Mike+Dricoll/default.aspx">Mike Dricoll</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/O_2700_Reilly+Strata+Conference+2011/default.aspx">O'Reilly Strata Conference 2011</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Data++Start-ups/default.aspx">Data  Start-ups</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Multicore+Crisis/default.aspx">Multicore Crisis</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Third+Nature/default.aspx">Third Nature</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Barry+Devlin/default.aspx">Barry Devlin</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+Data+Profiler/default.aspx">Pervasive Data Profiler</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataMatcher/default.aspx">Pervasive DataMatcher</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Mark+Madsen/default.aspx">Mark Madsen</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Joe+Dubin/default.aspx">Joe Dubin</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/MapReduce/default.aspx">MapReduce</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Multicore+Performance/default.aspx">Multicore Performance</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/9Sight+Consulting/default.aspx">9Sight Consulting</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Big++Data/default.aspx">Big  Data</category></item><item><title>Pervasive DataRush, Parallelism, Big Data and Hadoop are Top of Mind in Upcoming Pervasive Presentations </title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/01/28/pervasive-datarush-parallelism-big-data-and-hadoop-are-top-of-mind-in-upcoming-pervasive-presentations.aspx</link><pubDate>Fri, 28 Jan 2011 23:07:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:46698</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;January, February and March will be busy months for our technology evangelists. Pervasive DataRush Director of Product Management Davin Potts, Pervasive DataRush Chief Technologist Jim Falgout and Pervasive Software Chief Technology Officer Mike Hoskins will be speaking at major conferences over the next few weeks.&lt;/p&gt;

&lt;p&gt;At the O’Reilly Strata Conference (www.strataconf.com/strata2011) in San Francisco, running from February 1-3, Davin’’s “Supercharge Development and Performance of Hadoop Applications” presentation will take place on February 2 at 2:30 p.m. PST in the Mission City B4 room. He will discuss how to get more done faster with high-performance MapReduce and expand the universe of Hadoop possibilities with tools to speed and simplify development and deployment of analytic applications, such as Pervasive DataRush™. Pervasive Software also is exhibiting at the Strata event—you can find us at booth 501.&lt;/p&gt;

&lt;p&gt;For those in the Austin area, on January 29, Davin also will be speaking at DataDay Austin (www.datadayaustin.eventbrite.com). He will discuss “Reducing Complexity and Increasing Efficiency in the Land of Hadoop.” We hope you can join him.&lt;/p&gt;

&lt;p&gt;Pervasive Chief Technologist Jim Falgout will be addressing the European Data Innovation Summit (www.europeanintegrationsummit2011.com) audience in London at the Hilton London Tower Bridge on February 2. Jim’s presentation will be “Best Practices in Building Custom Applications for Big Data.” He will describe how the dataflow-based Pervasive DataRush framework delivers massive built-in parallelism to power applications that automatically scale up to consume the full capacity of commodity multicore servers.&lt;/p&gt;

&lt;p&gt;Jim also will be speaking at the KNIME User  Group Meeting (www.knime.org/about/events/ugm-workshop-2011), being held in Zurich on February 28-March 4. Stay tuned for more details.&lt;/p&gt;


At the GigaOM Structure Big Data conference (http://event.gigaom.com/bigdata/) on March 23 at Pier Sixty in New York City, Mike Hoskins will be part of the Hadoop panel that will be gathering at 1:30 p.m. Mike will be discussing parallelism and Big Data. Mike is always a font of industry knowledge and an astute trend predictor (and entertaining to boot).&lt;/p&gt; 


We look forward to seeing you—no matter the time zone--in the weeks ahead!
&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=46698" width="1" height="1"&gt;</description></item><item><title>Intel’s Concurrency Checker confirms powerful scalability of Pervasive DataRush &amp; Pervasive DataMatcher </title><link>http://cs.pervasive.com/blogs/datarush/archive/2011/01/17/intel-s-concurrency-checker-confirms-powerful-scalability-of-pervasive-datarush-amp-pervasive-datamatcher.aspx</link><pubDate>Mon, 17 Jan 2011 23:17:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:46572</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;The Intel Concurrency Checker can be used to evaluate the performance scaling of applications on multi-core systems and to help further optimize applications. It’s a tool that is used to check application threading and threading concurrency and can also be utilized to measure performance by running the application before and after making specific code enhancements and comparing the measured results. &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Computed Scaling&lt;br /&gt;&lt;/strong&gt;Computed scaling, or concurrency level, is Intel’s measure to predict the performance improvement factor when an application is run on a multi-core system, compared to a single-core system. The concurrency level is measured over a 30-second interval.&amp;nbsp; Pervasive Software recently used the &lt;a href="http://swpartner.intel.com/Partner/Initiatives/SATCenter.aspx?TFACode=MC&amp;amp;STEP=Download&amp;amp;Lang=ENG&amp;amp;cid=sw:PP_Community_IC_Checker_Banner_1" target="_blank"&gt;Concurrency Checker&lt;/a&gt; to conduct software performance testing of&lt;a href="http://www.pervasivedatarush.com/Pages/default.aspx" target="_blank"&gt; Pervasive DataRush v4.4&lt;/a&gt; running the MalStone B benchmark and &lt;a href="http://www.pervasivedatarush.com/products/Pages/DataMatcher.aspx" target="_blank"&gt;Pervasive DataMatcher 5.0&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Pervasive DataRush 4.4 running the MalStone B Benchmark&lt;br /&gt;&lt;/strong&gt;&lt;a href="http://opencloudconsortium.org/benchmarks/" target="_blank"&gt;MalStone benchmarks&lt;/a&gt;, developed by the &lt;a href="http://opencloudconsortium.org/" target="_blank"&gt;Open Cloud Consortium&lt;/a&gt;, provide a method for assessing data-intensive application performance for cloud-based clusters. MalStone datasets consist of information about web site visits and cyber infection status. The benchmarks calculate the rate of infection for each site (an anomaly that might signal intrusions or attempted intrusions).&amp;nbsp; &lt;/p&gt;
&lt;p&gt;Largely a ‘Read’ operation, the result shows Pervasive DataRush’s ability to tackle I/O intensive data activities with phenomenal throughput.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;em&gt;System Information:&lt;br /&gt;&lt;/em&gt;&lt;/strong&gt;Cores: 16&lt;br /&gt;Processor:&amp;nbsp; 164 Family 6 Model 15 Stepping 11 Intel Xeon CPU E7330 at 2.4GHz&lt;br /&gt;Operating System:&amp;nbsp; Windows Server 2008 R2 Standard Edition (build 7600), 64-bit&lt;br /&gt;Sockets: 4&lt;br /&gt;Logicals: 16&lt;br /&gt;&amp;nbsp;&lt;br /&gt;&lt;strong&gt;Pervasive DataMatcher 5.0&lt;br /&gt;&lt;/strong&gt;On the same 16-core system, Pervasive DataMatcher 5.0’s computed measured value was amazing. &lt;/p&gt;
&lt;p&gt;The test serves as another proof point about CPU- and process-intensive capabilities of Pervasive DataMatcher, which allows users to fully utilize all of the capacity of their multicore systems.&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;em&gt;System Information:&lt;br /&gt;&lt;/em&gt;&lt;/strong&gt;Cores: 16&lt;br /&gt;Processor:&amp;nbsp; 164 Family 6 Model 15 Stepping 11 Intel Xeon CPU E7330 at 2.4GHz&lt;br /&gt;Operating System:&amp;nbsp; Windows Server 2008 R2 Standard Edition (build 7600), 64-bit&lt;br /&gt;Sockets: 4&lt;br /&gt;Logicals: 16&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;&lt;u&gt;Intel Case Study&lt;/u&gt;&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pervasive’s testing results further validate the computed scaling performance of our highly robust Pervasive DataRush-based products. Candidly (and no pun intended), our results were off the scale. The results, even more, serve as additional substantiation that the multicore reality is here and now. &lt;/p&gt;
&lt;p&gt;As a developer or other professional seeking software capable of tackling big data challenges, we encourage you to check out &lt;a href="http://login.pervasive.com/access/index?assetkey=DataRushTrialRequest&amp;amp;asseturi=http://www.pervasivedatarush.com/pages/trialconfirmation.aspx&amp;amp;lid=PDR&amp;amp;form=wf-0002" target="_blank"&gt;Pervasive DataRush&lt;/a&gt; yourself. The Pervasive DataRush team also invites you to read Intel’s case study of our use of the Concurrency Checker. Look for the case study soon.&amp;nbsp; &lt;/p&gt;
&lt;p&gt;&lt;br /&gt;&lt;em&gt;*Based on 30-second elapsed time.&lt;/em&gt;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=46572" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataRush/default.aspx">Pervasive DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+revolution/default.aspx">multicore revolution</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel+processing/default.aspx">parallel processing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive/default.aspx">Pervasive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush+engine/default.aspx">DataRush engine</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelism/default.aspx">parallelism</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/intel/default.aspx">intel</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelizing/default.aspx">parallelizing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/runtime+scalability/default.aspx">runtime scalability</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush+applications/default.aspx">DataRush applications</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelization/default.aspx">parallelization</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+processors/default.aspx">multicore processors</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+Software/default.aspx">Pervasive Software</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+challenges/default.aspx">multicore challenges</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+hardware/default.aspx">multicore hardware</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Scalability/default.aspx">Scalability</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/data-intensive/default.aspx">data-intensive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/MalStone+B/default.aspx">MalStone B</category></item><item><title>Fully exploit your servers to meet analytic challenges on growing data sets</title><link>http://cs.pervasive.com/blogs/datarush/archive/2010/12/06/fully-exploit-your-servers-to-meet-analytic-challenges-on-growing-data-sets.aspx</link><pubDate>Mon, 06 Dec 2010 19:04:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:46155</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>0</slash:comments><description>&lt;h2&gt;We’ll show you how on Dec. 8 &lt;/h2&gt;

&lt;p&gt;Like most software organizations yours probably needs a cost-effective approach to deliver analytics or other data-intensive solutions amid increasing data volumes and growing processing complexity—one that allows you to enhance your ability to gain maximum performance from existing servers for data-intensive jobs. &lt;/p&gt;


&lt;h3&gt;Scaling performance, of course, will be a key to your success. How can you get it? &lt;/h3&gt;

 &lt;p&gt;Multicore processors from Intel, AMD and IBM redefine the compute density and capabilities of commodity servers. Now, rather than relying on endless scale-outs, you can scale up within every server in your cluster and achieve radical throughput gains when you prep, process and analyze Big Data. &lt;/p&gt;


&lt;p&gt;On December 8 at 12:00 p.m. CST/1:00 p.m. EST/11:00 a.m. PST, join us for an instructive webinar presented by Pervasive Software CTO Mike Hoskins and Pervasive DataRush Chief Technologist Jim Falgout to learn how to quickly build high-throughput analytics software that will give you eye-popping performance on multicore servers.   &lt;/p&gt;


&lt;h3&gt;Mike and Jim will detail how to: &lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;&amp;raquo; Deliver 20+ times the throughput at a fraction of the cost of deploying or expanding a cluster&lt;/li&gt;
&lt;li&gt;&amp;raquo; Quickly build your analytic models using a visual open source tool&lt;/li&gt;
&lt;li&gt;&amp;raquo; Avoid the multi-pass requirements of SQL and deliver high-throughput custom analytics&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Don’t miss it. &lt;/p&gt;


&lt;a style="color:white;padding:4px;font-size:large;background:black;text-decoration:none;display:block;width:300px;text-align:center;-moz-border-radius:5px;border-radius:5px;" href="http://www.pervasivedatarush.com/Pages/Webinar_DeliveringCustomAnalyticsthatKeepUpWithYourData.aspx"&gt;Register Now&lt;/a&gt;




&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=46155" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+DataRush/default.aspx">Pervasive DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Jim+Falgout/default.aspx">Jim Falgout</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+revolution/default.aspx">multicore revolution</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Multicore/default.aspx">Multicore</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/AMD/default.aspx">AMD</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel+processing/default.aspx">parallel processing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/density-based+clustering/default.aspx">density-based clustering</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/clustering/default.aspx">clustering</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/dataflow/default.aspx">dataflow</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive/default.aspx">Pervasive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush+engine/default.aspx">DataRush engine</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelism/default.aspx">parallelism</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/intel/default.aspx">intel</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelizing/default.aspx">parallelizing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/runtime+scalability/default.aspx">runtime scalability</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush+applications/default.aspx">DataRush applications</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/scalable/default.aspx">scalable</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallelization/default.aspx">parallelization</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+processors/default.aspx">multicore processors</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/data-intensive+applications/default.aspx">data-intensive applications</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive+Software/default.aspx">Pervasive Software</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/datarush-analytics/default.aspx">datarush-analytics</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel/default.aspx">parallel</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/parallel+programming/default.aspx">parallel programming</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+challenges/default.aspx">multicore challenges</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/multicore+hardware/default.aspx">multicore hardware</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Scalability/default.aspx">Scalability</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/data-intensive/default.aspx">data-intensive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/MalStone+B/default.aspx">MalStone B</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Hadoop/default.aspx">Hadoop</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Analytics/default.aspx">Analytics</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/density+based+clustering/default.aspx">density based clustering</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/distributed+clustering/default.aspx">distributed clustering</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/valuable+information/default.aspx">valuable information</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/dense+computing/default.aspx">dense computing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/clusters/default.aspx">clusters</category></item><item><title>You have my permission</title><link>http://cs.pervasive.com/blogs/talking_out_cloud/archive/2010/10/21/you-have-my-permission.aspx</link><pubDate>Thu, 21 Oct 2010 15:37:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:45678</guid><dc:creator>bjacaruso</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;I met a software vendor recently who was describing his product and he said, &amp;quot;We are a real cloud app, not like Google mail or something like that.&amp;quot; What? I guess there is an argument in the software world as to what qualifies as a &amp;#39;real&amp;#39; cloud app and what is not a &amp;#39;real&amp;#39; cloud app.&lt;/p&gt;
&lt;p&gt;That sounds funny to me. It seems like if your application leverages cloud computing, then it is a cloud application. If your application resides on your desktop, then your application is a not a cloud application.&lt;/p&gt;
&lt;p&gt;My acquaintance built a sophisticated business process management full SaaS application that ran on Microsoft&amp;#39;s Azure platform. I guess he felt insulted when people tried to compare him to, of all things, Gmail!!! Go figure.&lt;/p&gt;
&lt;p&gt;Gmail is quite impressive to me and certainly seems like a cloud app. Because of this conversation I have torn down the wall in my head. No longer do I try to classify a cloud app by its complexity, size or even where its code runs.&lt;/p&gt;
&lt;p&gt;I now say that if an application leverages cloud computing elements, then you have my permission to call it a cloud application. 
&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=45678" width="1" height="1"&gt;</description></item><item><title>Cloud Computing Solves Cost, Time and Expertise Puzzle</title><link>http://cs.pervasive.com/blogs/talking_out_cloud/archive/2010/10/15/cloud-computing-solves-cost-time-and-expertise-puzzle.aspx</link><pubDate>Fri, 15 Oct 2010 06:31:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:45601</guid><dc:creator>bjacaruso</dc:creator><slash:comments>1</slash:comments><description>
&lt;p&gt;I was speaking with a colleague yesterday about our cloud offerings.&amp;nbsp; We were discussing people who grew up in the on premise world and wanted to know what the big deal was. I realized I could have gone on for hours about the advantages of cloud, but I wanted to keep it short.&amp;nbsp; I came up with the core business area cloud improves on is IT infrastructure in the form of cost, time and expertise. Let me explain each of those.&lt;/p&gt;

&lt;p&gt;Cost advantages in the cloud are twofold.&amp;nbsp; First because of economies of scale it is simply less expensive in terms of server to server comparisons.&amp;nbsp; No matter what the sales guy at Dell or HP tells you, he cannot make it cheaper for you to own a Windows server for a month (once you roll in the underlying costs of maintenance). The second dimension of cost advantage is that my resources are now disposable.&amp;nbsp; When I am done with it, I can get rid of it.&amp;nbsp; I don&amp;#39;t even need someone to come pick it up and cart it off.&lt;/p&gt;


&lt;p&gt;Time advantages relate more to the procurement cycle. If I need a new server, load balancer, or more storage in the cloud, I make an API call.&amp;nbsp; If I need more of anything in my own data center I have to make a telephone call to a sales guy. I wonder which is faster?&lt;br /&gt;&amp;nbsp;
&lt;br /&gt;
Finally: expertise.&amp;nbsp; Granted that in any technology based infrastructure I will need experts, will I still need the guy whose sole job is to provision, update and build servers?&amp;nbsp; Probably not.&amp;nbsp; Cloud IaaS vendors are now even supplying a collection of preconfigured server / software packages ready to go as off the shelf images.&amp;nbsp; Think about getting a hardware based server, installing both an application server and database ready to go in your shop.&amp;nbsp; How many different people does it take?&amp;nbsp; On the cloud I can do it by the simple click of a button!&amp;nbsp;&lt;br /&gt;
&lt;br /&gt;
Cloud, when compared to an onsite IT data center really does have significant advantages.&amp;nbsp; When making the decision to cloud or not to cloud, be sure to take in the elements of cost, time and expertise.&amp;nbsp;&lt;/p&gt;
&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=45601" width="1" height="1"&gt;</description></item><item><title>Update Available:  DataRush 4.4.1 </title><link>http://cs.pervasive.com/blogs/datarush/archive/2010/10/06/datarush-4-4-1.aspx</link><pubDate>Wed, 06 Oct 2010 13:00:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:45472</guid><dc:creator>azeemj</dc:creator><slash:comments>0</slash:comments><description>&lt;p&gt;We&amp;#39;ve been working hard here at Pervasive and DataRush 4.4.1 is now available for download. Don&amp;#39;t let the small version number increment fool you, there are many worthwhile changes to DataRush. Including new documentation that shows how easy it is to use the DataRush analytics packages.&amp;nbsp; The new and updated documentation are easier to read and are more complete than before.&amp;nbsp;&amp;nbsp;DataRush now&amp;nbsp;supports &lt;a href="http://knime.org/" title="KNIME" target="_blank"&gt;KNIME&lt;/a&gt; 2.2.2, and is a recommended upgrade for all KNIME users.&amp;nbsp; &lt;/p&gt;
&lt;p&gt;If you&amp;#39;re new to DataRush, request a &lt;a href="http://pervasivedatarush.com/downloads/Pages/PervasiveDataRushDownloadRequest.aspx"&gt;free trial&lt;/a&gt; and see what DataRush can do for you.&amp;nbsp; &lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=45472" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Pervasive/default.aspx">Pervasive</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/KNIME/default.aspx">KNIME</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Analytics/default.aspx">Analytics</category></item><item><title>Did anyone notice that a 48-machine cluster is now available on a single server?</title><link>http://cs.pervasive.com/blogs/datarush/archive/2010/10/01/did-anyone-notice-that-a-48-machine-cluster-is-now-available-on-a-single-server.aspx</link><pubDate>Fri, 01 Oct 2010 15:14:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:45437</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>1</slash:comments><description>&lt;p&gt;&lt;span style="FONT-FAMILY:&amp;#39;Arial&amp;#39;,&amp;#39;sans-serif&amp;#39;;COLOR:black;FONT-SIZE:10pt;"&gt;&lt;em&gt;Did you notice the sea change taking place in big data computing?&lt;/em&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="FONT-FAMILY:&amp;#39;Arial&amp;#39;,&amp;#39;sans-serif&amp;#39;;COLOR:black;FONT-SIZE:10pt;"&gt;Multicore processors are redefining the compute density of a single server. When fully leveraged with the right software, multicore processors can provide aggressive parallelism to tackle big data challenges, outperforming much more expensive cluster computing. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="FONT-FAMILY:&amp;#39;Arial&amp;#39;,&amp;#39;sans-serif&amp;#39;;COLOR:black;FONT-SIZE:10pt;"&gt;Today, 4 AMD Opteron 6000 12-core processors in a single server can be more effective, green and cost friendly than relying on a cluster. A single-box solution removes cross-machine communication delays. With a cluster, you incur additional reliability overhead to accommodate the very real case that a single node goes down; but within a single multicore box, the probability of any single core going down is negligible. In a cluster, you incur overhead distributing data across a network; within a single multicore server, no such overhead is required. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="FONT-FAMILY:&amp;#39;Arial&amp;#39;,&amp;#39;sans-serif&amp;#39;;COLOR:black;FONT-SIZE:10pt;"&gt;&lt;em&gt;It all rests on the software you choose to build and deploy. Shouldn’t you make software design decisions based on where the industry is going – &lt;strong&gt;dense computing&lt;/strong&gt; – rather than where it&amp;#39;s been – &lt;strong&gt;distributed computing&lt;/strong&gt;?&lt;/em&gt;&lt;br /&gt;&amp;nbsp; &lt;br /&gt;&lt;strong&gt;On October 7, this coming Thursday,&lt;/strong&gt; Pervasive CTO Mike Hoskins and Pervasive DataRush Chief Technologist Jim Falgout will discuss how a single box utilizing Pervasive DataRush can deliver incredible performance while saving you the expense, overhead and maintenance of a cluster. &lt;a href="http://www.pervasivedatarush.com/Pages/ScaleUpBeforeJumpingintoHadoop.aspx"&gt;Register now&lt;/a&gt;.&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=45437" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/AMD/default.aspx">AMD</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/clustering/default.aspx">clustering</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/dense+computing/default.aspx">dense computing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/distributed+computing/default.aspx">distributed computing</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/clusters/default.aspx">clusters</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/single+server/default.aspx">single server</category></item><item><title>Pervasive PSQL v11 MC Release Candidate - But Wait, There's More</title><link>http://cs.pervasive.com/blogs/pervasive_answerman/archive/2010/08/11/pervasive-psql-v11-mc-release-candidate-but-wait-there-s-more.aspx</link><pubDate>Wed, 11 Aug 2010 15:34:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:44910</guid><dc:creator>answerman</dc:creator><slash:comments>3</slash:comments><description>&lt;p&gt;Dateline: August 11, Austin&lt;/p&gt;&lt;p&gt;Pervasive PSQL v11 MC Release Candidate is here. If you haven’t already downloaded the beta (and even if you have), you should definitely &lt;a href="http://www.pervasivedb.com/Database/Trials/Pages/PSQLv11_RC.aspx"&gt;download PSQL v11 MC RC&lt;/a&gt; and give it a test.&amp;nbsp; What started two years ago as a fairly quiet, under the radar type product plan has turned into a full blown big list of features release.&lt;br /&gt;&lt;b&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Part 1 - Multi-core&lt;/b&gt;&lt;/p&gt;&lt;p&gt;Raise your hand if you’ve bought a server in the last year. If you have, it’s got a multi-core processor. Minimum would be a dual core, more likely a quad, and lots of newer boxes are coming with 8-cores. Now, raise your hand if you’ve designed your application with a lot of parallel processes to run on those multiple cores. Most companies have upgraded hardware, most haven’t written their applications to take advantage of it.&amp;nbsp; Many applications are going to run more slowly as a result. (Managing multiple cores, cache and synchronizing everything creates a LOT of overhead.) For more on why this happens, &lt;a href="http://www.pervasivedb.com/Database/Documents/WP_The_Multi-core_Dilemma_20100723.pdf"&gt;check out the whitepaper from CITO Research - The Multi-core Dilemma.&lt;/a&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;The good news - Pervasive PSQL v11 has been redesigned to do a lot of things in parallel and is now optimized for multi-core hardware. For the details, check out the What’s New in Pervasive PSQL v11. Better yet download a trial. Or, you could start over with your current code, find which parts can be run in parallel, solve all of the issues related to synchronizing memory access, debugging, testing, etc. Probably only take a couple of years, if everything goes perfectly.&amp;nbsp; Trust me, the simplest thing to do is go with PSQL v11.&lt;br /&gt;&lt;b&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Part 2 - IPv6&lt;/b&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;The end is near. Really. The current standard for Internet addresses, IPv4, is due to run out of addresses in less than a year. Check out &lt;a href="http://penrose.uk6x.com/"&gt;http://penrose.uk6x.com/&lt;/a&gt; for a countdown clock.&amp;nbsp; What’s next - IPv6 - which will be using 128-bit addresses (instead of 32-bit with IPv4) and should last for a ridiculously long time. IPv4 had 4 billion addresses (sounds like a lot - but with 6 billion people on the planet and lots of devices, blogs, sites, etc. we were eventually going to run out). &lt;br /&gt;&lt;/p&gt;&lt;p&gt;IPv6 has 340,282,366,920,938,000,000,000,000,000,000,000,000 - or roughly, 340 undecillion (look it up, I did) addresses.&amp;nbsp; That’s about 57 trillion billion addresses for each and every one of us.&amp;nbsp; We won’t run out any time soon. The issue for you is that governments (Japan and US) and other large institutions are requiring IPv6 compatibility.&amp;nbsp; More good news - PSQL v11 already supports IPv6.&amp;nbsp; So when the end arrives, you’ll be ready.&lt;br /&gt;&lt;b&gt;&lt;/b&gt;&lt;/p&gt;&lt;p&gt;&lt;b&gt;Parts 3 through N - PDAC for RAD Studio 2009 and 2010, ADO.NET 3.5, 64-bit ODBC driver, etc.&lt;/b&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;As usual, Pervasive Engineering has done a fabulous job of adding all kinds of additional features and updates. I start sounding like Ron Popiel (think Ronco, pocket fisherman, veg-o-matic, inventor, and the greatest TV pitchman ever) - but wait, there’s more.&amp;nbsp; Here’s what else you get with PSQL v11:&lt;br /&gt;-&amp;nbsp;&amp;nbsp; &amp;nbsp;64-bit ODBC driver&lt;br /&gt;-&amp;nbsp;&amp;nbsp; &amp;nbsp;PDAC for RAD Studio 2009 and 2010&lt;br /&gt;-&amp;nbsp;&amp;nbsp; &amp;nbsp;ADO.NET 3.2 and 3.5 with Entity Framework Support&lt;br /&gt;-&amp;nbsp;&amp;nbsp; &amp;nbsp;Telephone activation&lt;br /&gt;-&amp;nbsp;&amp;nbsp; &amp;nbsp;New releases of AuditMaster and DataExchange - and soon Backup Agent&lt;br /&gt;-&amp;nbsp;&amp;nbsp; &amp;nbsp;Backward compatible, simple upgrade&lt;br /&gt;&lt;/p&gt;&lt;p&gt;Pervasive PSQL v11 helps you skip over some really huge technical hurdles (multi-core and IPv6) and adds a boatload of cool extra features. To get all of the details, go to the &lt;a href="http://www.pervasivedb.com/Database/Trials/Pages/Default.aspx"&gt;trials section&lt;/a&gt; of the Pervasive website and/or check out the &lt;a href="http://www.pervasivedb.com/Database/Products/PSQLv10/Pages/PSQLv11FAQ.aspx"&gt;Pervasive PSQL v11 FAQ&lt;/a&gt;.&lt;br /&gt;&lt;/p&gt;&lt;p&gt;It’s awesome.&lt;br /&gt;&lt;br /&gt;&lt;/p&gt;&lt;p&gt;&amp;nbsp;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=44910" width="1" height="1"&gt;</description></item><item><title>Can You Define (Really) Big Data?</title><link>http://cs.pervasive.com/blogs/datarush/archive/2010/08/06/can-you-define-really-big-data.aspx</link><pubDate>Fri, 06 Aug 2010 21:04:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:44868</guid><dc:creator>livey</dc:creator><slash:comments>0</slash:comments><description>&lt;font size="3"&gt;&lt;font face="Calibri"&gt;&lt;a href="http://www.information-management.com/authors/1033156.html" target="_blank"&gt;Jim Ericson’s&lt;/a&gt; blog on &lt;i style="mso-bidi-font-style:normal;"&gt;(Really) Big Data&lt;/i&gt; perked our interests.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;In the industry of big data, we’re always curious to know what IS big data?&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;It used to be terabytes, now it’s petabytes, and it’s exponentially growing.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;We agree with Jim, it’s not about how much data you have but how much valuable information you’re getting from it.&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp;&amp;nbsp; &lt;/span&gt;As cited by Davin Potts, our Director of Product Management, “Sipping from the firehose isn&amp;#39;t impressive. Big data is when I can scale to process the volume, keep up with the firehose, and afford to crunch through all of that data that I can get my hands on. Anything less is just a big number.”&lt;span style="mso-spacerun:yes;"&gt;&amp;nbsp; &lt;/span&gt;&lt;/font&gt;&lt;/font&gt;&lt;a href="http://www.information-management.com/blogs/big_data_hype_reality-10018470-1.html" target="_blank"&gt;&lt;font size="3" face="Calibri"&gt;Read more&lt;/font&gt;&lt;/a&gt;&lt;font size="3"&gt;&lt;font face="Calibri"&gt;.&lt;/font&gt;&lt;/font&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=44868" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/big+data/default.aspx">big data</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Jim+Ericson/default.aspx">Jim Ericson</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/valuable+information/default.aspx">valuable information</category></item><item><title>Pervasive DataCloud2 creates success for Amazon Web Services, Microsoft Azure and Force.com!</title><link>http://cs.pervasive.com/blogs/talking_out_cloud/archive/2010/07/12/pervasive-datacloud2-creates-success-for-amazon-web-services-microsoft-azure-and-force-com.aspx</link><pubDate>Mon, 12 Jul 2010 14:04:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:44580</guid><dc:creator>bjacaruso</dc:creator><slash:comments>1</slash:comments><description>&lt;p&gt;Reuven Cohen ( &lt;a href="http://bjac.it/brPe6V"&gt;http://bjac.it/brPe6V&lt;/a&gt; ) described this morning in a couple of paragraphs ( &lt;a href="http://bjac.it/aSDu5x"&gt;http://bjac.it/aSDu5x&lt;/a&gt; ) what I have been trying to explain for the last 18 months.&amp;nbsp; The Pervasive DataCloud2 is not a competitor to other clouds or Platforms as a Service, but a complementary service.&lt;/p&gt;
&lt;p&gt;I would go a step further and say that the boutique cloud services are critical for the continued success for the likes of Amazon, Microsoft and force.com.&amp;nbsp; I am sure that the guys in Seattle (both groups) and the teams in the valley ( &lt;a href="http://bjac.it/aJqxjd"&gt;http://bjac.it/aJqxjd&lt;/a&gt; ) would like to tell you that they can be everything to everyone.&amp;nbsp; But they know as well as we do that is not part of their strategy.&amp;nbsp; Sure, they will try to offer as much as they can in terms of variety of service, but in the end compromises will be made for the greater good of all.&lt;/p&gt;
&lt;p&gt;Pervasive DataCloud2 and services like it add pieces ( &lt;a href="http://bit.ly/j4n2U"&gt;http://bit.ly/j4n2U&lt;/a&gt; )&amp;nbsp; that these vendors cannot and probably should not offer as part of their platform.&amp;nbsp; Can a guy like Mark Benioff really justify spending time helping a company integrate their Netsuite with an on premise SAP implementation?&amp;nbsp; Hosting the service infrastructure makes perfect sense for Amazon, but should they develop the integration expertise?&amp;nbsp; Should Steve Ballmer and the crew in Redmond really be in the business of analyzing an organization’s healthcare data?&lt;/p&gt;
&lt;p&gt;I say no to all of the above, and even though they will not say it publicly, I am guessing the above list of cloud vendors know that it is a truth and fact that niche cloud services guys are a key component of their success.&amp;nbsp; You are welcome guys!&lt;br /&gt;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=44580" width="1" height="1"&gt;</description></item><item><title>Pervasive DataRush: Cost-effective security for companies in a challenging economic climate</title><link>http://cs.pervasive.com/blogs/datarush/archive/2010/07/07/pervasive-datarush-cost-effective-security-for-companies-in-a-challenging-economic-climate.aspx</link><pubDate>Wed, 07 Jul 2010 22:00:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:44543</guid><dc:creator>Richard Maddox</dc:creator><slash:comments>1</slash:comments><description>&lt;p&gt;Security spending in a downturn is under tight scrutiny. PricewaterhouseCoopers found this to be the case when it surveyed 7,200 executives in over 130 countries for its 2010 “Trial by Fire” report. One of the report’s primary findings states:&amp;nbsp; &lt;/p&gt;
&lt;p&gt;Not surprisingly, security spending is under pressure. Most executives are eyeing strategies to cancel, defer or downsize security-related initiatives. &lt;br /&gt;Source: &lt;a title="PricewaterhouseCoopers 2010 “Trial by Fire” " href="http://www.pwc.com/en_US/us/it-risk-security/assets/trial-by-fire.pdf"&gt;PricewaterhouseCoopers 2010 “Trial by Fire”&lt;/a&gt; report. &lt;/p&gt;
&lt;p&gt;PricewatershouseCoopers notes that 70% of survey respondents think it is important to consider canceling, deferring or downsizing security-related initiatives if they require capital expenditures while 71% respond similarly for initiatives requiring working expenditures.At the same time, survey respondents overwhelmingly said they considered security strategies to be important, including increasing the focus on data protection, prioritizing security investments on risk, reducing or mitigating major risks and accelerating the adoption of security-related automation technologies to increase efficiencies and reduce cost. With many analysts predicting a slow recovery, the pressure to cut or curb security spending could increase. Meanwhile, security intrusions continue, likely at a more sophisticated scale (consider &lt;a title="&amp;quot;Ghostnet&amp;quot;" href="http://en.wikipedia.org/wiki/GhostNet"&gt;“Ghostnet” &lt;/a&gt;for example). &lt;/p&gt;
&lt;p&gt;So, are there any cost-effective solutions available to meet growing security? Yes.The massively parallel-processing horsepower of the Pervasive DataRush™ data processing engine combined with our matching capabilities forms a powerful solution for translating massive amounts of raw data into actionable intelligence. Combined with Pervasive DataMatcher™, a cost-effective robust, innovative solution is available to financial, insurance, healthcare, law enforcement and homeland security organizations that want to leverage powerful, next-generation analytics for detecting fraud and corruption, complying with anti-money laundering controls, and security and compliance monitoring. &lt;br /&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;The Proof is in the Pudding: MalStone B-10 Benchmark&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;This year Pervasive DataRush conducted internal testing using the MalStone B. The benchmark examines large volumes of logfiles to look for anomalies that might signal intrusions or attempted intrusions.&amp;nbsp;&amp;nbsp; &lt;/p&gt;
&lt;p&gt;The data file for MalStoneB is generated by a Python script and the MalStone records have the format: &lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Event ID | Timestamp | Site ID | Compromise Flag | Entity ID&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;These describe a visit by an entity to a site at a particular time. After the visit, the entity sometimes becomes compromised, which is indicated by setting the compromise flag to 1. Each record is 100 bytes.&amp;nbsp; MalStoneB computes a ratio for each week d, and computes for each site w, and for all entities that visited the site at week d or earlier, the percent of visits for which the entity became compromised at any time between the visit and the end of the week d.The Malstone benchmark can use a variable sized dataset, In the experiment a 10 billion row dataset totaling 1 terabyte was used. &lt;br /&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Summary of Details:&lt;/strong&gt; &lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;32 core, 4 socket, 2.0 Ghz Intel Xeon X7550&lt;/li&gt;
&lt;li&gt;1890 seconds (32.5 minutes)&lt;/li&gt;
&lt;li&gt;5.29 million rows/sec, approx 509 Mbytes/sec&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;
&lt;p&gt;&lt;strong&gt;The Result: In just 31.5 minutes, 10 billion records were searched for anomalies using Pervasive DataRush. This result is 26x faster than its competition.&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Pervasive DataRush can enable applications that scale seamlessly on a single multicore server (rather than a cluster) to prepare or analyze even massive datasets – at unprecedented speeds.&amp;nbsp; Cost-effective approaches to daunting security challenges – that’s what IT executives seek in the midst of a lingering downturn, and that’s what we want to give them.&amp;nbsp; &lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=44543" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Terrorism/default.aspx">Terrorism</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Intelligence+Communities/default.aspx">Intelligence Communities</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/cyber+security/default.aspx">cyber security</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/MalStone+B/default.aspx">MalStone B</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/2+terabytes+an+hour/default.aspx">2 terabytes an hour</category></item><item><title>Distributed, Scalable Clustering for Detecting Halos in Terascale Astronomical Datasets.</title><link>http://cs.pervasive.com/blogs/datarush/archive/2010/07/02/distributed-scalable-clustering-for-detecting-halos-in-terascale-astronomical-datasets.aspx</link><pubDate>Fri, 02 Jul 2010 14:47:00 GMT</pubDate><guid isPermaLink="false">3741b99c-ad24-4023-9eca-ddf558b8b674:44518</guid><dc:creator>n5712036</dc:creator><slash:comments>1</slash:comments><description>&lt;p&gt;&lt;br /&gt;The process of &lt;a title="N-Body Shop" href="http://www.tacc.utexas.edu/research/users/features/quinn.php" target="_blank"&gt;stellar discovery&lt;/a&gt; has long made its home at High Performance Computing (HPC) systems. &lt;br /&gt;&lt;a title="HPC" href="http://en.wikipedia.org/wiki/High-performance_computing" target="_blank"&gt;HPC&lt;/a&gt; systems have evolved into clusters of &amp;quot;fat&amp;quot; multicore nodes. Applications must take advantage &lt;br /&gt;of parallelism across nodes and at the node level to maximize scalability and performance/watt. &lt;br /&gt;The complexity of multicore programming underscores the need for powerful and efficient runtime &lt;br /&gt;systems that manage resources such as cores, threads, memory, and communication sub-systems on behalf &lt;br /&gt;of the application. Dataflow is the computational model in &lt;a title="Datarush" href="http://cs.pervasive.com/controlpanel/blogs/www.pervasivedatarush.com" target="_blank"&gt;Pervasive Datarush&lt;/a&gt; to construct &lt;br /&gt;efficient data-parallel pipelines via threads while abstracting the complexity of multicore programming. &lt;/p&gt;
&lt;p&gt;Simulation of the dynamical evolution of the entire observable universe via &lt;a title="N-Body" href="http://en.wikipedia.org/wiki/N-body_simulation" target="_blank"&gt;N-Body&lt;/a&gt; interactions &lt;br /&gt;begins with the presumed first principles of the universe: cosmic background radiation; an expanding &lt;br /&gt;volume of cooling helium and hydrogen; dark matter separating from gas and coalescing into massive &lt;br /&gt;stars. The classical N-body problem simulates the evolution of a system of N bodies, where the force &lt;br /&gt;exerted on each body arises due to its interaction with all the other bodies in the system. N-body &lt;br /&gt;algorithms have numerous applications in areas such as astrophysics, molecular dynamics and plasma &lt;br /&gt;physics. The Cube3PM method for carrying out large N-Body simulations to study formation and evolution &lt;br /&gt;of the large scale structure in the universe combines direct particle-particle forces at small scales &lt;br /&gt;with particle-mesh ones at larger scales (Particle-Particle-Particle-Mesh Method). Such an approach &lt;br /&gt;produces datasets with 4000^3-5488^3 (64-165 billion) particles. Several such simulations were &lt;br /&gt;completed on Ranger Cluster (&lt;a title="Teragrid" href="http://www.tacc.utexas.edu/research/users/features/teragrid.php" target="_blank"&gt;Texas Advanced Computing Center&lt;/a&gt;) on 4,000-22,976 cores.&lt;/p&gt;
&lt;p&gt;From the astrophysicist’s perspective, the problem of identifying regions of interest in this terascale &lt;br /&gt;data and being able to visualize these regions is made intractable by the overwhelming volumes of data.&lt;br /&gt;Current methods to detect individual halos fall into two basic categories, namely friends-of-friends (FOF)&lt;br /&gt;and spherical over density (SO) methods. The FOF method is particle-based. Dense regions are identified &lt;br /&gt;by locating particles that are closer to each other than a pre-defined distance, which is a parameter of &lt;br /&gt;the model and is usually referred to as &amp;#39;linking length&amp;#39; . Particles that are within that distance from &lt;br /&gt;each other are called &amp;#39;friends&amp;#39;, and the halos produced consist of all particles which are connected by &lt;br /&gt;a chain of friends. The SO class of methods, on the other hand, start by identifying the local density &lt;br /&gt;peaks (or gravitational potential minima) as the halo centers and then expand spherical shells around &lt;br /&gt;those centers until a pre-defined density threshold (a free parameter of the model picked based on &lt;br /&gt;dynamical considerations) is crossed. Within these types of methods there are multiple variations, &lt;br /&gt;regarding e.g. how the halo centers are located, how the gravitationally-unbound particles are treated, &lt;br /&gt;etc. Each of the two basic approaches, FOF and SO, has its advantages and drawbacks and can fail in &lt;br /&gt;certain situations (&lt;a title="Tinker et. al" href="http://iopscience.iop.org/0004-637X/688/2/709" target="_blank"&gt;Tinker et al&lt;/a&gt;.). &lt;/p&gt;
&lt;p&gt;&amp;quot;Automated methods for halo identification and visualization are critical to advancing the physical &lt;br /&gt;understanding of what is happening through better analysis&amp;quot;, said Astronomy Centre at the&lt;br /&gt;University of Sussex, UK.&lt;/p&gt;
&lt;p&gt;Our dataflow methods supply an alternative to the current approaches which on one hand is &lt;br /&gt;density-based like the SO, but does not make assumptions about the halo shapes as the SO does.&lt;/p&gt;
&lt;p&gt;This dataflow implementation distributes itself across multiple nodes on the Longhorn cluster. &lt;br /&gt;Likewise, this parallelized dataflow AutoHDS facilitates the use of large number of cores on a &lt;br /&gt;single cheap machine instead of expensive super computers. Experiments revealed that when data &lt;br /&gt;points were uniformly distributed across partitions, dataflow AutoHDS achieved linear speed up &lt;br /&gt;with the increase in the number of machines used. Dataflow AutoHDS also yields better performance &lt;br /&gt;with increasing data volumes.&amp;nbsp; In comparisons against Hadoop AutoHDS, dataflow was consistently &lt;br /&gt;faster on fewer resources.&lt;/p&gt;
&lt;p&gt;This work has been submitted for publication.&amp;nbsp; Coming here soon....&lt;br /&gt;&lt;/p&gt;&lt;img src="http://cs.pervasive.com/aggbug.aspx?PostID=44518" width="1" height="1"&gt;</description><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/DataRush/default.aspx">DataRush</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/HPC/default.aspx">HPC</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/dataflow/default.aspx">dataflow</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/Nena+Marin/default.aspx">Nena Marin</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/terascale/default.aspx">terascale</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/density+based+clustering/default.aspx">density based clustering</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/astronomical+dataset/default.aspx">astronomical dataset</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/distributed+clustering/default.aspx">distributed clustering</category><category domain="http://cs.pervasive.com/blogs/datarush/archive/tags/stellar+discovery/default.aspx">stellar discovery</category></item></channel></rss>