IAC 2024 - IA Fast Track to Search Focused AI Solutions
Kitenga - Enabling Insight Through Analytics
1. Kitenga Analytic Suite
Enabling insight through analytics
Written by Author Joey Jablonski, Director of Product Management, Kitenga Analytics, Dell, Inc.
Abstract
Organizations today have a lot of data. To make the most
of it, they need the right tools—tools that can be deployed
rapidly and that make it easy for both IT staff and less technical
employees to turn that data into information meaningful to
the business. This tech brief explores analytics and introduces
several powerful tools that make it easy for you to rapidly
deploy a powerful analytical environment in your organization.
Introduction
Today, companies focus greater and greater resources to
ensure they are able to respond to changing market conditions,
customer preferences and competitive threats in a timely
manner. Today’s analytic technologies enable companies to
respond to these changes faster and more effectively.
Analytics is about transforming data into meaningful that
decisions makers can use to make appropriate changes to
how they operate the business and respond to the market.
Analytics is a collection of technologies that align with business
needs and enable deep visibility into corporate data assets,
while allowing iterative feedback to re-create reports based on
needs, questions and market conditions.
Kitenga Analytics Suite from Dell Software is an advanced
analytics toolkit that brings together natural language
processing, machine learning, search and advanced
visualization, connected by an integrated workbench meant for
non-developer staff. Kitenga is a business enabling technology
to provide access to big data to more staff who are closer to
the business and the decision-making process.
How analytics can help organizations like yours
Exactly how can analytics benefit organizations today? Let’s
look at two examples of how analytics can help organizations
improve their operations and their bottom lines.
2. 2
Healthcare organizations can provide
better care by analyzing patient data.
Every time a patient arrives at a
medical facility, an intake assessment is
completed. This assessment includes a
variety of standard questions, such as the
reason for the visit, symptoms, previous
visits and current medications. This
assessment enables the medical provider
to quickly determine the severity level
and the appropriate place within the
facility to route the patient.
With the right tools, though, intake
assessments can be even more useful.
By automatically comparing the current
patient to previous patient populations
and discerning patterns, the healthcare
facility could improve patient care, and
the organization’s bottom line.
Kitenga enables these proactive, advanced
analytics. It automatically analyzes patient
data and provides recommendations
to medical staff based on patterns that
may not be obvious at the time of initial
consultation with the patient.
Financial services organizations can
better assess transactions and risk by
performing contract valuation.
Companies that buy and sell baskets
of contracts (such as mortgages and
insurance policies) must complete a
due diligence process to review those
contracts for errors and discrepancies,
and to price and model the risk
associated with them. The pricing
and financial default models for these
baskets of contracts are often assumed
but not verified, despite the risk of these
complex transactions.
Kitenga can compare both structured
and unstructured data, enabling
organizations to review large groups
of contracts and access external
information (such as credit scores,
payment history and banking
information) for validation when
assessing a complex transaction.
Kitenga Analytics Suite delivers the
advanced analytics organizations
need today.
Today’s business problems require the
integrated analysis of a multitude of data
types, from a variety of sources. Kitenga
enables users to analyze, compare and
report on unstructured, semi-structured
and structured data from the same tool,
using the same algorithms, without
accounting for the data type or source.
This single source enables analysts to
quickly find meaning in complex data
sets and make decisions about the best
course for the business.
Core technologies
Kitenga contains four core technologies,
integrated for use from a single
workbench, and flexible in the workflow
required by the data analyst:
• Natural language processing (NLP)—
Kitenga can extract meaning from raw
text, including information about people
and places, as well as relationships
between entities.
• Machine learning (ML)—Kitenga has the
ability to “learn” from a given data set
and be trained over time to increase
the accuracy of sentiment grouping,
relationships and visualizations.
Figure 1. Dell’s solutions enable business users—not just IT staff—to access, analyze
and understand data.
Kitenga enables
users to analyze,
compare and report
on unstructured,
semi-structured and
structured data—all
from the same tool,
without accounting
for the data type
or source.
Integration
Boomi / Toad
Curation
Hadoop
Understanding
Kitenga
Share:
3. 3
• Visualization—Kitenga has the ability to
visualize the data that was extracted during
the NLP and ML phases of data analysis.
• Search—Search has become a powerful
enabling technology for accessing large,
complex information sets. Kitenga’s
advanced search capabilities enable users
to mine piles of data, and then presents not
only results that match the search query,
but also related data that the users may not
have initially thought to search for.
Kitenga enables non-technical staff
members to analyze complex data sets.
Kitenga’s intuitive interface enables a
broad range of staff to interact with and
analyze complex data sets that were
previously limited to software developers
and IT departments. Some common
users of Kitenga include:
• Medical researcher—Medical researchers
commonly review complex data sets to
identify patterns that were not previously
known so that processes can be modified
to improve patient care.
• Business analyst—Business analysts can
creating advanced analysis jobs and view
the results in a visualizing interface. Kitenga
makes it easy for them to take an iterative
approach, refining the data as understating
increases and data volumes grow.
• Data scientist—Data scientists can create
reusable analytics modules, called
extractors, to execute algorithms and
iterate over the results using Kitenga’s self-
service portal.
• Product manager—Product managers can
pull together separate data sets into a single
analysis to compare product information,
sentiment and market conditions to plan
new products, consider changes and
anticipate market conditions.
• Legal researcher—Legal researchers can
identify relationships across large, complex
data sets and narrow the set of documents
that must be manually reviewed by staff,
ensuring a higher success rate over manual
document reviews.
Kitenga’s intuitive
interface
enables non-IT
staff—including
researchers,
analysts, data
scientists and
product managers—
to interact with and
analyze complex
data sets.
Figure 2. Kitenga combines four core technologies, integrated for use from a
single workbench.
Search Visualization
KitengaTM
Machine
learning
Natural
language
processing
Share:
4. 4
Dell delivers complete analytics
solutions, as well as components
that integrate with your existing
environment.
Analytics requires not only data analysis
but effective presentation and interaction.
Analytics is more than just a single
application; it is a presentation and
interaction layer that empowers users,
powered by a strong data integration
layer and infrastructure. All components
must be rapid to deploy and must
operate together to provide flexibility as
business needs change.
With Dell, you can rapidly deploy a
powerful analytical environment Dell
offers complete analytical solutions,
as well as components for integration
with existing environments to ensure
maximum flexibility when deploying
analytical environments (see Figure 3).
We have already reviewed the key
capabilities of Kitenga. Let’s look at
some of the other Dell solutions that
help organizations perform effective
data analysis.
Toad for Hadoop
In data analytics projects, most of the
time is spent doing data integration—
that is, most of the project is spent
simply getting data into and out of
Hadoop. Toad for Hadoop enables
staff to quickly develop processes to
ingest data into Hadoop and format
it in a way that advanced tools,
including Kitenga, can analyze and
visualize. Toad for Hadoop allows for
a set of diverse connectors between a
Hadoop Distributed File System (HDFS)
environment and traditional databases,
including MySQL, Microsoft SQL Server
and DB2, along with a growing list of
other data sources and targets.
Shareplex
Shareplex is a high performance
solution for connecting Oracle database
environments to HDFS to enable bi-
directional data movement. Shareplex
enables administrators to replicate Oracle
redo logs into a Hadoop environment for
advanced analysis, recovery and efficient
data storage and archiving.
Dell | Cloudera Solution for
Apache Hadoop
The Dell | Cloudera Solution, shown
in Figure 4, delivers all the hardware,
software, resources and services
needed to run Hadoop in a production
environment. This end-to-end solution
enables you to be in production with
Hadoop in a shorter time than is typically
possible with homegrown solutions. By
listening to its customers, Dell designed
a Hadoop solution that is unique in
the marketplace, combining optimized
hardware, software and services to
streamline deployment and improve
customer experience.
Figure 3. Dell provides the components you need to create a powerful
analytics environment.
In a data analytics
project, a great
deal of time is
often spent simply
getting data into
and out of Hadoop.
Toad for Hadoop
makes it quick and
easy to put data
into Hadoop and
format it for use by
advanced tools
like Kitenga.
Advanced Analytics & Visualization (Kitenga)
Business Intelligence (Toad Intelligence Central)
Data Integration (Toad for Hadoop, SharePlex)
Structured Unstructured (Cloudera)
Advanced Analytics & Visualization (Kitenga)
Share:
5. 5
The Dell | Cloudera Solution is based on
the Cloudera Enterprise distribution of
Hadoop, including CDH4. Cloudera has
created a quality-controlled distribution
of Hadoop and offers commercial
management software, updates, support
and consulting services.
The server platform for the Dell
| Cloudera Solution is the Dell
PowerEdge™ C or R series. Dell
PowerEdge servers are focused on
hyperscale and cloud capabilities.
Rather than emphasizing gigahertz
and gigabytes, these servers deliver
maximum density, memory and
serviceability while minimizing total cost
of ownership.
Dell’s solution includes components that
that span the entire solution stack:
• Reference architecture and best practices
• Optimized server configurations for
PowerEdge R and PowerEdge C Series
• Optimized network infrastructure based on
Dell Force10 S4810 and S60 devices
• Dell Crowbar software framework for
deployment and management at scale.
• Cloudera CDH Enterprise software
• Hadoop infrastructure management tools
• Monitoring with Ganglia and Nagios
Conclusion
Analytics turn data into insight that
empowers organizations to make better
decisions. Dell delivers unsurpassed
visibility into complex data sets, enabling
you to identify patterns and discover
relationships, and then iteratively review
and understand the data. With Dell’s
unique analytic capabilities, you can build
an environment with a single analytic
tool chain across your unstructured,
semi-structured and structured data,
giving staff across the organization the
information they need to improve the
organization’s bottom line.
To learn more about Dell’s big
data solutions, contact your Dell
representative or visit www.quest.com/
kitenga-analytics-suite/.
Figure 4. The Dell | Cloudera Solution
The Dell | Cloudera
Solution delivers
all the hardware,
software, resources
and services needed
to run Hadoop
in a production
environment, so
you can be up and
running quickly.
Share: