- Ignore admin databases from mongo
- Optimizes performance for queries by speeding up time-range queries to MongoDB
- Adds metadata fields to default install
- Fixes tsds_writer queue name loading on startup
- Adds tsds_influx: A Mojolicious based webservice to ingest line protocol / influxdb formated metrics
- Adds support for environment variable based configuration for container based deployments
- Fixed a bug causing start and end parameters to sometimes not be parsed correctly.
- Fixed an issue with the test harness preventing a custom list of tests from being run.
- Update MongoDB perl driver to 'v2.2.2'
- Removes deprecated "update" and "insert" MongoDB calls
- Moves venv library import to shebang to fix Authen bug
- Fixes verification memory leak caused by MongoDB perl driver list indexes call
- Fixes POST request memory leak caused by uploadinfo call inside GRNOC::WebService::Method
- Adds in local changes made to code as 'hot fixes' that were never added to the Git repo.
- Adds in support for OL8 packaging
- Allows writers to move failed writers to separate queue to keep pipeline clear and allow for examination later.
- Adds default 2-day expiry for measurement decomissioning
- Writer processes will now automatically attempt to reconnect to Redis in the event of a failure instead of hanging.
- Fixed issue where mecached caching of metadata was inconsistent successful based on the order of the messages. This should improve and increase predictability of overall system performance.
- Including a new script for changing the values of required metadata fields. This will allow for things like renaming a host while maintaining continuity in the data.
- Fixed issue introduced in 1.6.7 where subqueries to constrained locations would fail due to their use of a temporary table.
- Fixed issue with reconnecting to RMQ in the Push webservice API
- Allow for updating metadata through the standard tsds_writer messages. This change simply requires adding ".metadata" to the type being sent. Non required metadata fields in the message will be processed accordingly. This is a separate message from the previously standard value messages.
- Allow for scoping of certain data types and metadata patterns for users in the push API. This allows additional users to push data without exposing the risk that they send data for elements they are not allowed to.
- Improved performance when changing interval of datasets.
- Removed the unused
event
portion of code in an effort to consolidate and simplify code.
- Fixed issue where it was possible to bypass a constraint. Previously if a particular view did not have the measurement type defined at all, it was incorrectly considered unconstrained instead of globally denied.
- Tweaked the caching and message fetching worker processes employ to help reduce memory usage, particularly in cases where things are backed up
- Adjusted sphinx configuration to only search for active measurements. This feature will ultimately be removed in a future release, but this helps existing deployments that have gathered lots of measurements over time.
- Support '=' character in where clause's, such as
where x like "="
- Added 'day' as a support target for the 'align' option in aggregate
- Added support for nested unwind operations, such as extracting an array of elements inside of an arrayed element.
- Parent metadata elements should no longer be indexed. These fields are never directly accessed in TSDS and are taking up space / index slots in Mongo for no reason.
- Fixed case where != applied to a numeric datafield would generate an incorrect query.
- Fixed case where when aggregating data, a bucket of all null datapoints would return 0 in the case of sum'ing. Now returns null.
- Fixed issue with get_meta_field_values not respecting the offset parameter
- Added "__timestamp" as an optional by clause element. Including this in the by clause will make unique merging also consider the start/end times when determining uniqueness. For example, grouping by a circuit's first interface by default will pick one interface and use it for the entire query timeframe. Grouping by the circuit's first interface and __timestamp will make it use the first interface for as long as that interface existed, and then the next interface, etc for the duration of the query. Careful use of order by is recommended when doing this to ensure your results are sane.
- Fixed an issue introduced in the last release that would cause aggregates of aggregates to emit a lot of warnings in cases where some value types weren't defined.
-
Added "align" clause to aggregate() calls. This feature does more intelligent time alignment than straight epoch seconds math, such as weeks or months that may start/end on inconsistent extents. It also does an implicit level of aggregation to that, so aligning to week will generate a weekly average or max, depending on what aggregate it's attached to. This allows queries to specify a different level of aggregation to fetch data vs how to combine it without resorting to subqueries in all cases.
e.g. aggregate(values.input, timeframe, average) align week
-
Added ability to use "count" as an aggregator function in aggregate() calls. This allows queries to return the number of datapoints in a bucket independent of their values.
-
Refactored some of the unit tests to include longer duration interfaces, and to run the Writer process more efficiently.
-
Added tsds_setup script to make first time installation and setup of pieces such as Mongo more streamlined and automated.
- Fixed issue where histograms returned from inner queries were not processed correctly in outer queries making using of the aggregate() function.
-
ISSUE=8708 Substantially reduce memory usage in some cases by pre-processing documents as they stream from mongo instead of post accumulation.
-
ISSUE=8332 Changed "dirty" flag in documents from a timestamp to a boolean to increase index efficiency. This requires working with tsds-aggregate >= 1.2.0, as a similar change was made there.
-
ISSUE=9001 Changed calls to perl MongoDB's
database_names
to a more direct call that can utilize the nameOnly parameter, avoiding excessive MongoDB locking during a lot of operations. -
ISSUE=9002 Improved speed when combining histograms in a number of common cases by avoiding recalculating indexes when the histograms shared the same dimensions.
-
Optimized to remove an unneeded query starting in Mongo 3.4 by utilizing $facet operator. This allows for count of total hints + pagination to work in a single query instead of two.
-
Optimized to remove testing for
$exists
in fields that will always be present. -
Reduced overall amount of database connections used by lazy connecting in several places, collapsing certain cases to a single connection, and using a singleton approach for cases where multiple libraries may be making the same connection.
- Made the start/end consideration in all queries more efficient by removing unneeded cases.
-
ISSUE=8029 Refactor the way the data and metadata documents are merged during queries to be a lot more efficient, particularly in high metadata cardinality cases.
-
ISSUE=6517 Added support for another argument to the extrapolate function to request a series of datapoints back. This uses the same logic but instead of returning a single number returns an array of points using the calculated slope/intercept so that it can be easily plotted.
-
ISSUE=5336 Changed the way expiring measurement entries is internally calculated to be much more efficient. This now makes use of index only queries to find the needed data instead of performing a lot of fetches.
-
Allow for queries on the "identifier" field. This is useful as a shorthand because "identifier" is used to unique identify a particular measurement series.
-
Improved speed at which we process metadata documents that do not have any associated data documents. This also improves the speed of any query where no data fields are selected.
-
ISSUE=5973 Fixed issue where queries with subqueries and math operations that also generated very large numbers would get handled incorrectly. This was a subtle function of how perl represents very large numbers in a string context.
-
ISSUE=7470 Fixed issue where updating metadata with all of the same values but a different end time could result in duplicate in service docs.
-
Fixed issue where trying to perform math operations using numeric metadata fields as one of the operands would crash, such as values.input / max_bandwidth.
-
ISSUE=3222 Updates to using perl driver 1.0.0+ and Mongo 3.4. This release is backwards incompatible with hosts running earlier versions of the driver, since the driver is itself backwards incompatible.
-
Added systemd unit files for EL7.
-
Allow updates of metadata where only the end time is set, ie decomission a measurement.
-
Update Image service to use https to better interoperate with browsers implementing hsts.
- ISSUE=5417 Added "sum" as a secondary function to aggregate() to support summing horizontally across an aggregate query with multiple elements in a by clause
- ISSUE=4088 Properly use histogram when calculating percentile() on aggregate fields
- ISSUE=5402 Fixed off by one timestamp reporting in aggregate() function
- ISSUE=5415 Forced extent alignment in timestamps reported in aggregate() function to requested bucket size
- ISSUE=5057 Added aggregate chart to Portal section. Also added additional metadata to the Portal section display, such as interface descriptions.
- ISSUE=2633 Fixed missing inclusion of tsds-decom.pl script
- ISSUE=5057 Fixed sorting issue with Portal section not being respected.
-
ISSUE=2194 Added several new support symbols to the query BNF, including +, !, #, $, %, and ?
-
ISSUE=2221 Revamped the way that measurement entries get decommed after not receiving new data for a configurable period of time. Fixed several issues related to incorrectly detecting data during historical migrations, fixed a few race conditions.
-
ISSUE=2222 A series of fixes and enhancements to better support parallelization, including better locking around race conditions in the metadata management API.
-
ISSUE=2166 Added "tsds_fix_measurements.pl" script to fix any outstanding problems that should be addressed going forward by changes in 2221 and 2222.
Fixed issue where "get_distinct_meta_fields" function was not respecting constraints.
- ISSUE=458 Make aggregate queries expand the start and end times of the actual query to ensure that they align on a proper time boundary. The actual start and end times are returned as fields in the response.
- ISSUE=471 Improved human readability of query parse errors.
- ISSUE=394 Rewrite of tsds_meta from python to perl and updated to work with Mongo 3.0 line of code, removal of python dependencies from package.
- ISSUE=464 Fixed issue where if the metadata for a particular measurement ended AFTER the query time, data from later than the query end could still be returned.
- ISSUE=755 Fixed issue where writer would start to thrash when receiving messages with an invalid value for one of the "values" fields. Messages will properly be detected and skipped if so.
- ISSUE=708 Fixed edge case where writer would complain about aggregate messages with a histogram width with a floating point value. This was only seen in cases where data variances were very small, such as in optical readings.
- ISSUE=12536 query language now supports relative (from "now") and epoch timestamps in between clause
- ISSUE=12641 MongoDB document writes are now done in bulk operations to improve performance with less round trips
- ISSUE=12401 improved performance for search suggestions using new get_distinct_meta_field_values() webservice method
- ISSUE=12463 set the "updated_start" and "updated_end" timestamps in data documents for new aggregation framework
- ISSUE=12408 support data aggregation within the writer worker processes when receiving aggregate message for new aggregation framework
- ISSUE=12464 aggregate data from other aggregated data when possible (1 day aggregates will use 1 hour aggregate data)
- ISSUE=12566 properly set eval position for the default aggregation and expiration rules upon new installation
- ISSUE=12412 require data interval to be a positive, non-zero integer
- ISSUE=12499 fixed default file path location for sphinx search index template files
- ISSUE=12500 properly handle unicode characters when indexing sphinx search data
- ISSUE=12142 now open sourced on GitHub: https://github.com/GlobalNOC/tsds-services
- ISSUE=11649 added ability to set the classifier flag on non hierarchical meta fields
- ISSUE=12085 added ability to not use SI notation on a per measurement type basis
- ISSUE=12129 added support for an optional
having
clause in queries to post-filter results - ISSUE=12129 added ability to perform math on get fields, such as
average(values.input) - average(values.output)
- ISSUE=12129 support arbitrary nested aggregation operations instead of just two layers
- ISSUE=12131 support ability to search/filter by value
- ISSUE=12133 added ability for the user to define which fields the query is grouped by in the TSDS browse section's advanced search
- ISSUE=12135 added ability to only take the first unique pairing when doing a group by, allowing queries to specify uniqueness without doubling up data such as in the case of circuits with two endpoints
- ISSUE=12149 addressed issues in INSTALL doc and converted it to use markdown syntax for GitHub
- ISSUE=12232 added script to fix overlapping measurement documents
- ISSUE=12241 added index on "identifier" in measurement collection to improve writer lookups on cache misses
- ISSUE=12355 added
tsds_find_unsharded.pl
script to help with identifying bootstrapping issues on some collections that have been deployed - ISSUE=12362 moved unknown field warning messages down to debug level
- ISSUE=12315 added --noconfirm option to tsds_upgrade.pl to allow for unattended upgrades if desired. The default is still interactive.
- ISSUE=12027 fixed issue where event query for one day didn't include events whose start and end are not on the same date in the result
- ISSUE=12111 fixed bug where writer would fail when it encountered a measurement type without a valid metadata collection & document
- ISSUE=12145 removed problematic code that tried to optimize messages with > 100 data points
- ISSUE=12304 added missing indexes on start and end in measurements collection to upgrade script
- ISSUE=12319 moved temporary workspace over to a single, shared, sharded collection to prevent a MongoDB deadlock bug
- ISSUE=12343 fixed problem where aggregate processes were stacking on top of one another ignoring existing lock files
- ISSUE=12363 fixed issued where aggregate data collections weren't getting sharded and added unit tests to verify necessary collections are
- ISSUE=12126 Fixed issue where the search API was not respecting the order of the ordinal fields set on value fields
- ISSUE=11369 add missing last_updated index to measurements collection
- ISSUE=10785 added new metadata management webservice API that will be used in a future frontend release
- ISSUE=12026 allow configuration for sphinx search instances that reside on remote servers
- ISSUE=11900 several INSTALL documentation updates and improvements
- ISSUE=11495 added support for different constrained views of the data configured at separate URLs
- ISSUE=11951 changed default parallel processes used when aggregating data from 32 to 4
- ISSUE=11369 added new global search API using sphinx for backend indexing
- ISSUE=11801 global search query support from browse section
- ISSUE=11556 data aggregation and expiration rules are now maintained separately
- ISSUE=11793 added support for angle, square, and curly brackets in the query language
- ISSUE=12042 fixed issue where search section was only sorting the current page
- ISSUE=12060 provide index hints for MongoDB queries to try and address performance problems
- ISSUE=12019 only allow lowercase a-z and underscore characters for measurement type and meta field names
- ISSUE=11936 writer will now re-activate decom'd measurements
- ISSUE=11896 greatly improved performance by not processing existing data points of a document being updated
- ISSUE=11881 fix bug where we were holding onto all the memory of every document being updated in a single message update
- ISSUE=11814 fix bug to be able to clear ordinal field on value types
- ISSUE=11802 continuously attempt to reconnect to rabbit in case of connection failure
- ISSUE=12091 fixed issue where queries with a limit and a sort may not be returned in the proper order
- ISSUE=12102 make sure values stored in mongo don't get stringified
- ISSUE=11681 Added authentication support. This change is not backwards compatible with existing clusters running in no-auth mode and requires upgrading them. The upgrade script must only be run after authentication/authorization has been enabled and a root user has been created.
- ISSUE=11495 added initial constrained/custom view support
- ISSUE=11368 improved bootstrap/install unit tests
- ISSUE=9627 starting working on global search support (not yet available)
- ISSUE=11635 fall through to use aggregate data if no high res docs exist
- ISSUE=11633 fix document expiration algorithm to match what aggregations do (based upon policy eval position)
- ISSUE=11281 chkconfig support fix for init script
- ISSUE=11429 disallow dashes in measurement type names (MongoDB doesn't like them)
- ISSUE=11390 allow null end timestamps for events
- ISSUE=11281 Fixed typo in tsds_writer init script where chkconfig would not recognize it
- ISSUE=11290 Improve performance by specifying index hints and not fetching the same high res docs multiple times when aggregating
- ISSUE=10949 always store 1000 data points inside data documents based upon provided interval instead of 2 hour long documents
- ISSUE=11070 increase aggregate doc data points from 100 to 1000 and store them as three dimensional arrays
- ISSUE=11225 added support for aggregation histogram configuration in admin services
- ISSUE=11080 dont disable journaling in provided config file
- ISSUE=10805 only retrieve aggregate histogram data when its needed by the query to improve performance
- ISSUE=11187 use measurement min & max when calculating histogram buckets, or fall back to min_width when doing dynamic calculation if needed
- ISSUE=10798 added ability for tsds_firehose.pl test script to do best-case, worst-case, and sin-cos import options
- ISSUE=11007 do full doc merge and replace instead of supplying mass update clauses to mongo when doing large amounts of updates
- ISSUE=10462 added sorting and pagination for portal results
- ISSUE=10697 automatically set ordinal values for the required metadata fields when adding new measurement types
- ISSUE=10708 added ability to set the maximum number of documents and the maximum number of events a query can return
- ISSUE=10944 prevent writer from crashing when handling invalid/malformed data
- ISSUE=10925 fix issue where the writer wasn't requeueing messages to try again later when MongoDB was unavailable
- ISSUE=11193 fixed issue where the tsds_install bootstrapping script was not properly adding values to measurement types from their .json files
- ISSUE=10957 uniquely identify events by start time + identifier instead of event text
- ISSUE=10841 fix pymongo's ConnectionFailure namespace and fixing hard coded rabbitmq host and port in tsds_meta.py script
- ISSUE=10702 aggregate measurements only once per interval using the last eval_position aggregation rule match
- ISSUE=10700 fix bug where eval position for aggregate rules was being calculated incorrectly when changed
- ISSUE=10703 Moved TSDS SNAPP Migrator into its own package out of grnoc-tsds-services
- ISSUE=10701 fix tsds_expire.pl script
- ISSUE=10691 Including missing aggregation.cgi and admin.cgi files
- ISSUE=7992 support migrating data from old SNAPP
- ISSUE=9992 added temperature bootstrapping data to install
- ISSUE=10141 added admin section webservice support
- ISSUE=10254 added server side graph support
- ISSUE=10353 support data interval changes
- ISSUE=10356 improved init script for perl tsds_writer
- ISSUE=10520 added backend changes for data retention methods
- ISSUE=10528 added event support in perl tsds_writer
- ISSUE=9951 properly handle case when no measurements exist when aggregating data
- ISSUE=10429 improve parallelization for data aggregation, lock file support, improved performance, and fixed file permissions
- ISSUE=10430 fix aggregate data fall through support issues and improved performance
- ISSUE=10495 Added "alternate_intf" to store the other form of an interface's name, such as "Te0/0/0" versus "TenGigabitEthernet0/0/0"
- ISSUE=10631 Fixed issue with fetching values for metadata fields that have a _ in them when using logic parameters, such as "alternate_intf_like"
- ISSUE=10525 fix problem where _execute_mongo() got confused with any warning messages from the mongo CLI
- ISSUE=10247 Added generalized support for "events" in TSDS in both the writer processes and in the query parsing engine.
- ISSUE=10355 Added aggregation and expiration policy definition webservice support.
- ISSUE=10254 Initial version of server side get_chart method to allow rendering of charts on the server instead of the client.
- ISSUE=10022 Fix bootstrap issues with new versions of MongoDB 3.0
- ISSUE=10141 Changed collection existence check to use listCollections for performance reasons.
- ISSUE=10201 Fixed writer processers to attempt to auto reconnect to MongoDB instead of dieing
- ISSUE=10429 Fix issue where aggregator would attempt to issue a blank bulk query if nothing to do for a document and generate errors.
- Initial release of TSDS Services. This package provides a Domain Specific Language and REST interface to data stored in a MongoDB timeseries database. It allows for pushing data into the system as well as retrieval of data out. It also includes scripts to help manage aggregation and expiration of the data as well as a script to read messages from RabbitMQ for data insertion.