Aspire is a content ingestion and processing technology specifically designed for unstructured data, such as Office documents, PDFs, web pages, images, voice, and videos. It provides over 40 connectors to a variety of enterprise content sources, including File Shares, SharePoint, Documentum, OneDrive, Box.Com, SalesForce.com, ServiceNow, Confluence, Yammer, and others.
Aspire provides a powerful solution for connectivity, cleansing, enrichment, and publishing of content to enterprise search, unstructured content analytics, and natural language processing applications.
As part of our collection of technology assets for enabling search and unstructured content analytics, Aspire is available standalone or as part of the Applied Intelligence Platform, AIP+.
This fall, our team is excited to announce Aspire’s latest enhancement release – Aspire 4.0 – with notable innovative advancements.
New enhancements in Aspire 4.0
1. Elasticsearch can now be used to hold the Aspire crawl database. This database is used by Aspire for internal processing and job queues.
- Clients who already have Elasticsearch as their search engine can use that same server cluster to hold Aspire’s crawl database. This can substantially reduce infrastructure, hardware, and technology requirements for clients who use Aspire with Elasticsearch.
- MongoDB and HBase are still available in Aspire 4.0 as alternative database providers.
2. New end-points for receiving real-time document updates to configured content sources.
- In addition to real-time updates, these new end-points can also be used to re-process document updates which may have failed in downstream systems, or which may be discovered based on audit checks.
3. A Background Processing & Binary Storage Layer – part of our Intelligent Document X-Ray initiative – allows for slow-running background tasks to be queued and processed when resources become available.
- This is an ideal framework for long-running processes, such as machine learning and Optical Character Recognition (OCR).
- Currently, only file storage is allowed as the storage layer. Other storage layers, for example, Amazon S3, Azure Blobs, Google Cloud Platform storage, will be provided in the near future.
4. Many bug fixes and stability and performance improvements are provided.
<<< Start >>>
Aspire Content Processing 4.0 admin dashboard
<<< End >>>
Other key features
Aspire 4.0 also includes:
- Refactor of failover implementations for higher stability, accuracy, and usability
- Throttling capabilities built into the connector framework to support content crawl throttling to protect legacy systems from being overburdened. Previously, this was done by reducing thread count. The new framework enforces more precise document-per-second throttling controls.
- Enhanced security measures that allow for scripting of business rules to handle special security requirements, for instance, to automatically add groups based on the user’s e-mail address
- Improvements to help simplify installation and license management
- The additions of new and improved connectors to our growing range of over 40 connectors supporting unstructured content acquisition from enterprise repositories
- Substantially improved Confluence connector with full hierarchical security support
- An improved publisher for Google Cloud Search
- A new “test crawl” feature to help test new content sources quickly and easily
- User interface improvements and restyling
Visit our Aspire technical documentation for more details.