IAP Component: Information Transformation Layer Transforming Raw Data into Valuable Content
Endeca’s Information Transformation Layer (ITL) provides content acquisition and enrichment capabilities to help you efficiently combine information from any source into a single integrated view and add value on top of the raw data. Our ITL integrates with existing ETL packages and comes with an out-of-the-box data integration tool designed for extracting and enhancing both unstructured and structured data. Key ITL Features Include:ConnectivityContent acquisition includes connectivity through ODBC/JDBC, XML, or web services standards, packaged adapters to common repositories such ERP systems, and crawls for information in nearly 400 different file types from file systems, websites or CMS repositories. Join SupportEndeca supports the use of joins which allows information from different sources to be combined by any shared attributes across all records. Join support also enables multiple branches of work to converge as appropriate rather than subjecting every record to every possible processing step. Data Cleansing and EnrichmentData pipelines are fully configurable and support a number of techniques that improve data quality or augment the metadata on records. Capabilities include rules-based data processing, entity extraction, and statistical processing to extract important values which can be applied as metadata to the source record. Taxonomy ManagementWhere available, taxonomies can be included as a data source and used for defining navigation options or defining content enrichment terms. Endeca is also able to build taxonomies directly from attributes in the source data. ExtensibilityA Content Adapter Development Kit is provided for cases where direct access to proprietary systems is required. Custom data cleansing or content enrichment packages can also be inserted into the data pipeline to provide best-of-breed 3rd party functionality in addition to Endeca’s capabilities. Developer ToolsEnd-to-end content acquisition and enrichment is configurable through an intuitive graphical user interface, allowing rapid application development instead of time-consuming integration scripting.
See how Endeca is leading the industry in enterprise search technology and empowering more than 600 leading organizations such as Walmart.com, ESPN, the U.S. Defense Intelligence Agency, and IBM.
|