Solutions Review’s annual Vendors to Know in Data Management Solutions provides the details on the most critical solution providers in the space.
The editors at Solutions Review continually research the most prominent and influential data management vendors to assist buyers in search of the tools befitting the needs of their organization. Choosing the right vendor and solution can be a complicated process; it requires constant market research and often comes down to more than just the solution and its technical capabilities. To make your search a little easier, we listed the vendors to know in data management software.
Note: Companies are listed in alphabetical order.
Vendors to Know in Data Management Software, 2021
Description: 1010data provides integrated capabilities for database management and data analytics. The company’s flagship product, 1010edge, also features data modeling and visualization, reporting, and application development. 1010 brings disparate data together to provide a granular view, and the solution scales to any size. In addition, the tool’s columnar data storage capabilities present data in an orderly fashion.
Amazon Web Services
Description: Amazon Web Services (AWS) offers Amazon Redshift, a fully managed, petabyte-scale data warehouse that analyzes data using an organization’s existing analytic software. Redshift’s data warehouse architecture allows users to automate common administrative tasks associated with provisioning, configuring, and monitoring cloud data warehousing. Backups to Amazon S3 are continuous, incremental and automatic. Redshift also includes Redshift Spectrum, allowing users to directly run SQL queries against large volumes of unstructured data without transforming.
Ataccama ONE is a comprehensive master data management product that offers an intriguing list of capabilities for many use cases. The solution offers a machine learning-centric user interface, as well as a data processing engine that is responsible for data transformations, evaluating business rules, and matching and merging rules. The platform supports any data, domain, and a variety of integrations.
Cloudera provides a data storage and processing platform based on the Apache Hadoop ecosystem, as well as a proprietary system and data management tools for design, deployment, operations and production management. Cloudera acquired Hortonworks in October 2018. It followed that up with a buy of San Mateo-based big data analytics provider Arcadia Data last September. Cloudera’s new integrated data management product (Cloudera Data Platform) enables analytics across hybrid and multi-cloud.
Commvault is well-known in the backup and disaster recovery marketplace, performing as one of the top solution providers. The company also offers a cloud data management product that allows organizations to manage data via on-prem and cloud deployments. Users can fully manage data across files, applications, databases, hypervisors, and clouds (including Amazon Web Services, Microsoft Azure, Google Cloud, and Oracle Cloud). The tool also includes Commvault’s popular backup and disaster recovery, as well as e-discovery, capabilities.
DataStax offers a distributed hybrid cloud database built on Apache Cassandra. The company’s flagship product is DataStax Enterprise, a solution that makes it easy for enterprise to exploit hybrid and multi-cloud environments via a data layer that eliminates complexity associated with deploying applications across multiple on-prem data centers or multiple public clouds. Its enterprise data layer eliminates data silos and cloud vendor lock-in and powers mission-critical applications.
Druva Phoenix offers data availability and governance functionality for virtual machines and physical servers. Its cloud-centric approach is unique and combines high-performance, scalable backup, disaster recovery, archival, and analytics. The product can be deployed quickly at sites located around the world while also aligning with regional data storage regulations. Phoenix can also be managed from a central location to provide full control over server backups and data composition.
Google offers a fully-managed enterprise data warehouse for analytics via its BigQuery product. The solution is serverless and enables organizations to analyze any data by creating a logical data warehouse over managed, columnar storage, and data from object storage and spreadsheets. BigQuery captures data in real-time using a streaming ingestion feature, and it’s built atop the Google Cloud Platform. The product also provides users the ability to share insights via datasets, queries, spreadsheets and reports.
Hewlett Packard Enterprise (HPE)
Hewlett Packard Enterprise (HPE) is the enterprise software arm of the computer hardware giant HP. The vendor offers a cloud-based database management solution on Amazon Web Services, Microsoft Azure, or via an individually licensed model. Vertica provides an MPP SQL analytical database with linear scaling native high availability that allows organizations to query data in near real-time.
Hitachi Vantara is a wholly owned subsidiary of Hitachi, Ltd., and offers an expansive portfolio of products for integrating, managing and analyzing data. Hitachi’s portfolio of data management solutions are best suited for modern environments, and can help organizations to quickly improve their key performance metrics, including business continuity, backup windows, operational recovery and disaster recovery. The tool also provides data protection and recovery for complex enterprise architectures.
IBM has data management products for virtually every enterprise use case. Its products can be deployed in any environment, and partnerships with some of the other top names in the marketplace make it an even more intriguing option for organizations with large workloads and expansive data jobs. IBM also offers its Informix database that can integrate SQL, NoSQL/JSON, time series and spatial data.
Informatica’s big data management platform allows organizations to access, integrate, clean, master, govern, and secure big data. The tool features purpose-built connectors to hundreds of data sources, real-time streaming, and mass ingestion. Informatica’s visual developer interface also ensures that the best open-source platforms can be adopted without sacrificing usability. Public cloud support for Big Data Management is available on AWS and Microsoft Azure.
The MapR Converged Data Platform integrates Hadoop, Spark, and Apache Drill with real-time database capabilities, global event streaming, and storage. The product also provides support for dozens of open-source projects and uses industry-standard APIs. Converged Data Platform comes in two editions based on included modules, capabilities, support features and commercial add-ons.
MarkLogic offers an operational and transactional enterprise NoSQL database that is designed to integrate, store, manage, and search for data. Organizations can ingest structured and unstructured data with a flexible data model that adapts to changing data. It also natively stores JSON, XML, text, and geospatial data. MarkLogic’s Universal Index enables users to search across all data, and APIs enable application development and deployment. The database has ACID transactions, scalability and elasticity, and certified security as well.
Microsoft offers an array of data management products, including those for analytics, data governance, and even data virtualization. Its SQL Server solution provides data warehousing for both on-prem and cloud deployments, as well as an in-memory database. Microsoft allows organizations to access, store, and analyze any kind of data and even offers fully-managed Hadoop and Spark. The company is one of the major players in the overall big data marketplace, with top-ranked tools in business intelligence and data integration.
Oracle’s suite of data management capabilities allows users to manage both traditional and new data sets on its cloud platform. The company also offers an autonomous data warehouse cloud with more than 2,000 SaaS applications. The platform runs the gamut of big data functionality, with support for data integration and analytics as well. Its other data management offerings include Oracle Big Data Cloud, Oracle Big Data Cloud Service, Oracle Big Data SQL Cloud Service, and Oracle NoSQL Database.
Pivotal’s flagship data management product is its Greenplum Database. The solution provides integrated analytics on multi-structured data. Greenplum can be deployed on all major public and private cloud platforms, and on-prem in data centers. Its cost-based query optimizer is designed for large data volumes, and scales interactive and batch-mode analytics to large datasets in the petabytes. The platform is based on PostreSQL and Greenplum Database. Based in San Francisco, Pivotal is owned by Dell EMC.
Precisely offers its data integration capabilities via two product families, Precisely Connect and Precisely Ironstream. The company’s flagship application and data integration tools are the Precisely Connect product family. Syncsort allows users to hasten database queries and applications by putting relational databases to best use. The Intelligent Execution feature dynamically selects the most efficient algorithms based on the data structures and system attributes it encounters at run-time.
Riversand is a master data management (MDM) and product information management solution provider. The company’s MDM offering features a multi-domain core designed to provide a complete view of enterprise data. In addition, Riversand includes high-scale computing, a set of streamlined collaboration tools, and data governance functionality. Reporting via the vendor’s data visualization product is included with each license and offers users the ability to run advanced analysis via charts, tables, and dashboards.
SAP offers its data management capabilities on a single platform. SAP HANA allows users to collect and combine all types of data in-real time, as well as enhance data governance, monitoring, and orchestration. Users can also create a unified view of data with smart data integration that enables advanced applications and data management. The platform is flexible and can be deployed on-prem, in the cloud or via hybrid deployments. HANA is an in-memory tool with fast data processing and advanced analytics with OLAP and OLTP processing.
SAS is the largest independent vendor in the data management marketplace. The company’s main product is built atop a data quality platform that allows users to improve, integrate, and govern enterprise data. SAS Data Management can ingest data from legacy systems and Hadoop, and create rules once and reuse them. In addition, users can update data, tweak processes, and analyze results themselves. A built-in business glossary as well as third-party metadata management and lineage visualization capabilities allow for collaboration.
Snowflake offers a cloud data warehouse built atop Amazon Web Services. The solution loads and optimizes data from virtually any source, both structured and unstructured, including JSON, Avro, and XML. Snowflake features broad support for standard SQL, and users can do updates, deletes, analytical functions, transactions, and complex joins as a result. The tool requires zero management and no infrastructure. The columnar database engine uses advanced optimizations to crunch data, process reports, and run analytics.
The Stibo Trailblazer Enterprise Platform (STEP) features data quality capabilities designed to handle data profiling, data matching, and enrichment with external reference data. It also includes a user-friendly interface for implementing business rules, checks, and controls. The graphical interface verifies uncertain de-duplication and matching with external sources.
Talend offers an expansive portfolio of data integration and data management tools. The company’s flagship data management product, Talend Data Management Platform, features graphical tools and wizards, and more than 900 pre-built components and connectors to natively connect databases, flat files, and cloud-based applications. An included data mapper and parsing capabilities allow users to map complex EBCDIC files, XML, JSON, and EDI documents.
Tamr offers a machine learning-based data integration product called Unify. The solution allows organizations to connect to any tabular data and publish it anywhere. Users can map schemas with machine learning suggestions and normalize data formats using Spark and SQL. Tamr’s Master Records feature provides a complete view of all entities via simple yes and no questions as well. Tamr has also begun offering an issue tracker specifically designed for data called Steward (beta).
Teradata offers a broad spectrum of data management solutions that include database management, cloud data warehousing, and data warehouse appliances. The company’s product portfolio is available on its own managed cloud and on Amazon Web Services and Microsoft Azure. Teradata provides organizations the ability to run diverse queries, in-database analytics, and complex workload management.
TimeXtender offers an integrated data management platform called Discovery Hub. The solution allows organizations to merge data from a wide variety of sources and automate data models so users can easily understand data. Discovery Hub can be deployed on-prem, in the cloud or via a hybrid model, and there is no need to manually migrate data schemas or metadata. TimeXtender is a Microsoft Gold Certified Partner and touts more than 3,000 customers around the globe.
The Zaloni Data Management Platform operationalizes data along the entire pipeline, from data source to consumer. ZDP automates repeatable data management tasks and processes and provides central management of all enterprise data sources whether on-prem, cloud, multi-cloud, or hybrid. Zaloni is compatible with all major Hadoop distributions, most data processing engines, and applicable deployment models.