Blog

ContraxSuite 1.3.0 Release Update

ContraxSuite logo

Client demand for document and contract analytics tools like ContraxSuite continues to grow. The LIBOR transition is a big part of this shift, one that we explore more deeply in our white paper.

More generally, though, organizations are looking for a powerful platform that can flexibly meet needs as they shift from quarter to quarter. We have continued to expand the scale, capability, and features of ContraxSuite in order to meet this demand. That is why we are very proud to announce the latest release of ContraxSuite, version 1.3.0. ContraxSuite Release 1.3.0 is the twenty-fifth open-source release and became generally available on November 4th, 2019. Read on for a detailed description of the latest updates to ContraxSuite.

 

End User Interface Release Notes

Release 1.3.0 focuses on the following broad issues:

  • Wide-ranging UI/UX improvements, including many new features in the Data Grid and Annotator screens, providing users greater control over the data they see, and what they can do with it.
  • New functionality that can be used to format and export data from other platforms into ContraxSuite.
  • Expanded capacity for document uploading and parsing.
  • Improvements to memory allocation for bulk tasks.
  • Fixes, improvements, and expanded functionality for Admin tools.

Detailed Changelog for End User Interface

New Features in Release 1.3.0
  • Annotator: New collapsible left side pane that can be toggled to display the following:
    • A document’s internal Table of Contents, in a collapsible “Section Navigation” sidebar.
    • All definitions found within a document, in a collapsible “Definitions List” sidebar. Users can click definitions in the list to highlight them in the main viewing pane.
    • The option to toggle annotation highlights on or off.
  • A “New Custom Links” tab is now available from the Main Menu. This feature allows users to add important external links directly to their ContraxSuite instance.
  • A new Dynamic Content Loader, which displays content on the Data Grid automatically, as needed:
    • Documents appear in Data Grid as user scrolls up or down the list.
    • Data appears in the Data Grid as user scrolls from left to right.
    • Greater clarity of highlights within the Annotator screen and the right-side pane when a field is selected.
    • Faster load times when Field Values are saved, deleted, or selected.
    • Improved navigation to terms found via Search bar.
  • Annotator: Projects with large numbers of Document Fields can now have their Document Fields displayed in numbered and paginated “Categories” on the right side pane, for easier targeted search and smoother workflow.
  • Other new features in Release 1.3.0:
    • Project admins now have a searchable Document Field Values Grid.
    • List of Projects on the Main Menu is now searchable.
    • Users can now easily download any document from the Annotator screen by clicking the “Download” button in the top-right corner. Any documents downloaded in this way will retain their original file format.
    • Project admins can now clone Document Types and their associated Data Fields.
    • Data can now be formatted and exported from other platforms into ContraxSuite.
    • A new “Stats” page on the left-side navigation pane. The Stats page displays statistics and useful data on all projects within the ContraxSuite instance.
    • When a user opens a Batch Analysis document in the Annotator, the document’s color-coded cluster will be displayed next to the document’s name along the top of the main viewing pane.
    • Documents can now be deleted from projects without affecting de-duplication of newly uploaded documents.
    • Large sets of documents in compressed files (.zip, .tar, .tar.gz, .tar.xz) up to 50 MB can be uploaded directly into ContraxSuite.
Improved Features in Release 1.3.0
  • Usage of Word2Vec and Doc2Vec models, using continuous bag-of-words (CBOW) to enhance data models.
  • Faster bulk-updating of statuses and assignees for large amounts of documents.
  • Durations parser for both EN and DE languages has been enhanced.
  • Clicking Data Science -> Data Interface now opens up a new browser window.
  • A project’s Document Type (e.g., “Lease” or “Employment Agreement”) is now displayed under the project name at the top left of the main viewing pane.
  • Improved memory allocation for tasks such as bulk search, and searching within the Annotator’s main viewing pane.
  • Warning messages and error messages have been revised for clarity.
  • Admin Interface: Improved navigation from the Data Grid view to a specific Document Type or Document Field page.
  • Annotator: Improved the display of conditional fields and logic on document creation screens.
  • Annotator: Users can now search for keywords, or search via regex.
  • Annotator: “Batch” view now named “Unclassified”.
  • Annotator: Ctrl+F now opens the Annotator’s search bar, rather than the browser’s. The Esc key will close the search bar.
  • Data Grid: Data Grid export is now in .xslx format by default (previously, .csv was the default).
  • Data Grid: Column width expanders are now easier to see and use.
  • Data Grid: “Column Visibility” pop-up now has a “Select All” checkbox.
  • Document Upload: Users can now choose to cancel an upload job before it’s completed.
  • Document Upload: If a “Load Documents” task fails, all documents that were not successfully uploaded will be erased from database.
Bug Fixes
  • Tesseract and Apache Tika run more smoothly on high-volume document sets.
  • Deleting documents created bugs with re-indexing and with re-uploading similar documents into the same project.
  • Some documents failed to upload, and text failed to be extracted.
  • Esc button functionality had not been implemented for all windows and data fields.
  • Admin Interface: An error occurred when trying to create new Document Field categories.
  • Admin Interface: Left side menu didn’t display in Chrome.
  • Admin Interface: Trying to save a new Document Field configuration would sometimes fail.
  • Admin Interface: Adding a Field in “Depends on fields” would fail.
  • Admin Interface: Added improved search functionality to save and share search query parameters in the URL address.
  • Advanced UI: Clustering project documents would cause a task failure.
  • Annotator: Selections within multi-choice fields were not correctly saving.
  • Annotator: Pressing the arrow buttons on the top right of the main viewing pane would not cycle through all Data Field values if the right side pane wasn’t expanded.
  • Annotator: If a user attempted to delete field values associated with an annotation, system would warn user about deleting info and ask for confirmation, yet data would be cleared from database before confirmation.
  • Annotator: When a text unit had too many labels, they would reach outside of the main viewing pane.
  • Annotator: Document Fields would not display 0 in the right-side pane, even if 0 was the correct value for that field.
  • Annotator: For Document Fields with the option to add a new Field Value, the + button did not appear in the right side Document Fields pane, and added “Other” values were not properly saving.
  • Annotator: Annotator lagged when creating new “Other” values for annotations.
  • Annotator: Error resulted when user tried to delete a field value after deselecting its associated text.
  • Annotator: Writing a single note would sometimes create two notes.
  • Annotator: Selecting a Currency Type in a Currency field’s drop-down menu was not saving properly.
  • Annotator: Entering a value into a Ratio field was not saving properly.
  • Annotator: In Microsoft Edge and IE, highlighted fields were not displaying properly.
  • Annotator: Percent fields were displaying incorrect values.
  • Annotator: In IE it was possible to enter a value into a Document Field even without the required annotation.
  • Annotator: It was possible to save the default phrases, such as “Enter other value”, as actual values in Choice fields.
  • Annotator: Ctrl+C and Ctrl+V were not working properly.
  • Batch Analysis: Some data fields were not showing up in the Batch View Data Grid.
  • Batch Analysis: Moving from Cluster view to Data Grid view would show no documents in the Data Grid.
  • Batch Analysis: Data Grid would sometimes not show all documents in a project.
  • Data Grid: Documents were duplicated on the grid if sorting was applied.
  • Data Grid: Bulk assigning documents to a user maxed out at 200 documents.
  • Data Grid: Selecting one document in Data Grid would sometimes display total number of documents in project next to “items selected”.
  • Data Grid: “Property Address” Field would show code that should have been hidden.
  • Data Grid: Different filtering functions were not displaying properly, such as ascending and descending sort direction arrows not appearing in Microsoft Edge.
  • Data Grid: Selections made in “Column Visibility” pop-up would not correctly display after clicking “Save” button.
  • Data Grid: Checkbox icons were not aligned properly with the columns in which they appeared.
  • Data Grid: Right-clicking to “Open New Tab” or “Open New Window” was not working correctly.
  • Document Upload: A 502 error will only temporarily interrupt document upload. Upload now restarts automatically when error resolves.
  • Document Upload: Users can now resume a stopped document upload without re-uploading documents that are already in the project.
  • Document Upload: Percent of documents uploaded was displaying improper numbers.
  • Document Upload: Adding documents to a project via the “Add Contracts” button would create an upload error.
  • Main Menu: The list of “Recent Projects” was not updating correctly.
  • Main Menu: Project list wasn’t expanding properly in IE.
  • Project Settings: Username would appear in “Add User” list, even after that same user had already been added to the project.
  • Numerous other bug fixes in Clustering pane, Data Grid, Annotator, and Data Field entry.

 

Backend Release Notes

Release 1.3.0 focuses on the following broad issues:

  • Improvements related to stability of interactions with the database, and performance of document loading.
  • Updates aimed to optimize field values and annotations storage.
Detailed Backend Changelog
  • Massive refactoring of Document Field Values:
    • DocumentFieldValue model replaced with separate FieldValue, FieldAnnotation and FieldAnnotationFalseMatch models, which allow querying of field values and annotations as-is, without the need to re-store values of complex fields from annotations.
    • Django-simple-history is now supported for field values.
    • Various improvements of the system related to field value storage and processing.
  • Database switched from PostgreSQL 9.6 to PostgreSQL 11. Postgres is configured dynamically based on the available CPU and memory resources of the host machine.
  • Jupyter Labs added to Contraxsuite.
  • Improved Celery task logging: colored task logs, more human-readable and less verbose, shortened stack traces, with stack traces collapsible in task details UI.
  • Old ClusterProjectDocuments task is purged if a new one is started, and API has been updated to deliver the right clustering status in cases where the task is aborted.
  • Improved Clustering algorithm to incorporate all term counts.
  • Added new API endpoint to get project progress.
  • Added custom logo URL to app variables.
  • Created UI to review pg_stat_activity and pg_stat_statement PostgreSQL tables to debug active or long database queries.
  • Created UI to review database size and database tables, and the structure and size of indexes.
  • Large sets of documents in compressed files (.zip, .tar, .tar.gz, .tar.xz) up to 50 MB can be uploaded directly into ContraxSuite.
  • Added API to make full CRUD operations with custom menu groups and links.
  • Improved logic related to deleting documents in case of a failed LoadDocuments task.
  • Improved logic related to uploading duplicate documents to a project, including when existing document is in “pending delete” status.
  • Added ability to cancel an “upload documents” job.
  • Fixed ability to use definitions incorporated into Field Detector.
  • Added API to accept lists of filenames and sizes before uploading them to check for duplicates.

LexNLP Changelog

  • Made massive improvements to EN definitions and companies parsers.
  • Updated EN dates parser to catch more date formats.
  • Made company parsing strongly typed.

The full release notes and changelog for Version 1.3.0, and for all prior versions of ContraxSuite, are available at the ContraxSuite GitHub page.

Stay up to date on ContraxSuite’s expanding capabilities by reading our series of LIBOR-focused blog posts, or check out our LIBOR white paper.

To get started with ContraxSuite, visit our website, or drop us a line at contact@lexpredict.com.

Comments are closed, but trackbacks and pingbacks are open.