Pervasive DataRush

This blog is syndicated from the Pervasive DataRush site.

January 2011 - Posts

  • Pervasive DataRush, Parallelism, Big Data and Hadoop are Top of Mind in Upcoming Pervasive Presentations

    January, February and March will be busy months for our technology evangelists. Pervasive DataRush Director of Product Management Davin Potts, Pervasive DataRush Chief Technologist Jim Falgout and Pervasive Software Chief Technology Officer Mike Hoskins will be speaking at major conferences over the next few weeks.

    At the O’Reilly Strata Conference (www.strataconf.com/strata2011) in San Francisco, running from February 1-3, Davin’’s “Supercharge Development and Performance of Hadoop Applications” presentation will take place on February 2 at 2:30 p.m. PST in the Mission City B4 room. He will discuss how to get more done faster with high-performance MapReduce and expand the universe of Hadoop possibilities with tools to speed and simplify development and deployment of analytic applications, such as Pervasive DataRush™. Pervasive Software also is exhibiting at the Strata event—you can find us at booth 501.

    For those in the Austin area, on January 29, Davin also will be speaking at DataDay Austin (www.datadayaustin.eventbrite.com). He will discuss “Reducing Complexity and Increasing Efficiency in the Land of Hadoop.” We hope you can join him.

    Pervasive Chief Technologist Jim Falgout will be addressing the European Data Innovation Summit (www.europeanintegrationsummit2011.com) audience in London at the Hilton London Tower Bridge on February 2. Jim’s presentation will be “Best Practices in Building Custom Applications for Big Data.” He will describe how the dataflow-based Pervasive DataRush framework delivers massive built-in parallelism to power applications that automatically scale up to consume the full capacity of commodity multicore servers.

    Jim also will be speaking at the KNIME User Group Meeting (www.knime.org/about/events/ugm-workshop-2011), being held in Zurich on February 28-March 4. Stay tuned for more details.

    At the GigaOM Structure Big Data conference (http://event.gigaom.com/bigdata/) on March 23 at Pier Sixty in New York City, Mike Hoskins will be part of the Hadoop panel that will be gathering at 1:30 p.m. Mike will be discussing parallelism and Big Data. Mike is always a font of industry knowledge and an astute trend predictor (and entertaining to boot).

    We look forward to seeing you—no matter the time zone--in the weeks ahead!
  • Intel’s Concurrency Checker confirms powerful scalability of Pervasive DataRush & Pervasive DataMatcher

    The Intel Concurrency Checker can be used to evaluate the performance scaling of applications on multi-core systems and to help further optimize applications. It’s a tool that is used to check application threading and threading concurrency and can also be utilized to measure performance by running the application before and after making specific code enhancements and comparing the measured results.

    Computed Scaling
    Computed scaling, or concurrency level, is Intel’s measure to predict the performance improvement factor when an application is run on a multi-core system, compared to a single-core system. The concurrency level is measured over a 30-second interval.  Pervasive Software recently used the Concurrency Checker to conduct software performance testing of Pervasive DataRush v4.4 running the MalStone B benchmark and Pervasive DataMatcher 5.0.

    Pervasive DataRush 4.4 running the MalStone B Benchmark
    MalStone benchmarks, developed by the Open Cloud Consortium, provide a method for assessing data-intensive application performance for cloud-based clusters. MalStone datasets consist of information about web site visits and cyber infection status. The benchmarks calculate the rate of infection for each site (an anomaly that might signal intrusions or attempted intrusions). 

    Largely a ‘Read’ operation, the result shows Pervasive DataRush’s ability to tackle I/O intensive data activities with phenomenal throughput.

    System Information:
    Cores: 16
    Processor:  164 Family 6 Model 15 Stepping 11 Intel Xeon CPU E7330 at 2.4GHz
    Operating System:  Windows Server 2008 R2 Standard Edition (build 7600), 64-bit
    Sockets: 4
    Logicals: 16
     
    Pervasive DataMatcher 5.0
    On the same 16-core system, Pervasive DataMatcher 5.0’s computed measured value was amazing.

    The test serves as another proof point about CPU- and process-intensive capabilities of Pervasive DataMatcher, which allows users to fully utilize all of the capacity of their multicore systems.

    System Information:
    Cores: 16
    Processor:  164 Family 6 Model 15 Stepping 11 Intel Xeon CPU E7330 at 2.4GHz
    Operating System:  Windows Server 2008 R2 Standard Edition (build 7600), 64-bit
    Sockets: 4
    Logicals: 16

    Intel Case Study

    Pervasive’s testing results further validate the computed scaling performance of our highly robust Pervasive DataRush-based products. Candidly (and no pun intended), our results were off the scale. The results, even more, serve as additional substantiation that the multicore reality is here and now.

    As a developer or other professional seeking software capable of tackling big data challenges, we encourage you to check out Pervasive DataRush yourself. The Pervasive DataRush team also invites you to read Intel’s case study of our use of the Concurrency Checker. Look for the case study soon. 


    *Based on 30-second elapsed time.

More Posts