CHANGES
=======

0.6.1
-----

* readme: fix rst formatting

0.6.0
-----

* utils: fix keep\_file to return True when a file is not excluded
* setup: removes server left-over
* readme: document the new python module usage
* Remove server components
* download-logs: remove include-path option
* utils: prevent division by zero on Windows
* process: use a functional argument for process\_line
* process: use a functional argument for keep\_file
* process: use a functional argument for the filename2modelname
* tests: remove python version pinning and rename to unit
* process: simplify tokenizer by removing ansible std\_lines unpack
* process: simplify files\_iterator function by removing non list input
* process: simplify process function by removing non list input
* process: simplify train function by removing non list input
* process: add save\_file and load\_file to simplify joblib usage
* cmd: annotate cli functions
* process: annotate the test function
* utils: add type annotation for the files\_iterator
* data: add Build data type
* data: add initial data type using TypedDict for process Result
* Ignore reformatting change rev for git blame
* Reformat source code using black
* Enable mypy linters
* download: add support for absolute log path in indexes
* Switching Fedora runc nodeset to pod
* Update filename2modelname method to work with kubernetes logs

0.5.2
-----

* Remove webui build by default from the spec

0.5.1
-----

* Fix download-logs usage

0.5.0
-----

* Add command line filters configuration file
* Add benchmark script
* Add only\_model option to the MQTT service
* Remove SCL macros from the spec file
* Fix exclude-lines regexp joblib dump
* Report result to zuul using an artifact instead of the log\_url
* Improve tokenizer for OpenStack logs
* Add exclude lists to the MQTT service
* Add mqtt based worker
* Fix IPv6 tokenization when the last field contains letter
* Add mqtt client service structure
* Refactor worker process in the main logreduce module
* Try to detect binary file
* Ignore bzip2 extension
* Ignore pcap file
* Add exclude-lines argument
* Move the log link to the additional info in the html output
* Fix concurrent logreduce-tests job
* Ignore atop.bin file and tarball
* Make server installation optional
* Fix scikit-learn 0.21.3 support
* Ignore files from supportconfig plugins
* Ignore files from /proc/net
* Ignore files with .raw extension
* Correctly show the gaps between hunks of context
* Show the gaps between hunks of context like grep does
* Add some short options for convenience
* Show default values in help text
* Reduce deprecation warnings by declaring raw strings
* Add server configuration instructions

0.4.0
-----

* Bump model version and fix typo
* Add HashingAnnoy model
* Add hashing\_nn benchmark in doc string
* Add HashingApproximateNeighbors model
* Implement iterator interface for file-like objects
* Refactor TokenizerTests
* Provide a bit more info about timings of the training
* Remove support for bag-of-words\_lshf
* Don't store duplicate data in model
* Fix heat\_uuid regexp formatting
* Relax digits\_re again a bit
* Vectorizer optimisation: don't do word analysing
* debug\_lineprocess: Handle more than one input file
* debug\_lineprocess: Format output slightly nicer and remove duplicates
* Tighten heat\_uuid regexp
* Tighten length-based regexp matches properly
* debug\_lineprocess add some simple word / token statistics
* Blacklist .xml extension
* Use for loop instead of handcrafted while construct
* tests: use free tcp port for gearman server
* Add --model-type argument to top-level command
* tokenizer: remove sshd warnings
* Make debugging scripts callable again
* Reduce code duplication a bit
* Micro-optimize the tokenization
* ci: enable gate jobs
* Make systemd service file SCL independent
* Transition webui related files to the log-classify name
* JSON encode Journal and AraReport object type
* Match uuid\_re before heat\_re
* Use SqlAlchemy intrinsics for ordering
* Fix overly greedy date tokenization
* Fix tokenization error on removing SSH fingerprints
* DRY: Remove implementation override that also exists in the base class
* Fix assertEquals() deprecation warning
* Use generator for reading files
* tokenizer regexp speedups
* cmd: add --json argument to report options
* Spelling typos
* logreduce: Fix inconsistency for model\_file in model-run
* logreduce.spec: Fixes
* README: Add openSUSE instructions
* Add py36/py37 to the env list as well
* Run pep8 against pip installed flake8

0.3.0
-----

* Add support for ara database classification
* Add webui sub-package to the spec file
* tokenizer: handle hexa numbers
* react: fix typo and add build time
* Add React web interface
* Add worker service to process user report
* Add anomaly update method
* Add model skeleton and report import method
* Add api skeleton

0.2.0
-----

* Use ara[-\_]\*.\*/ in the default ignore paths list
* Fix download asyncio loop and logger names
* Record test command used to train models
* Add a uuid to model object
* Remove chunk grouping in the process function
* Rewrite html output using patternfly
* Collect ZuulBuild in anomaly report
* Add --cacheonly argument to skip file download
* Add ara-.\* to the default ignore list
* Rewrite ZuulBuilds download module to discover base log\_url
* common: small fixes for automated process

0.1.3
-----

* Update zuul-jobs log-classify role
* Fix ARA report directory link to ara-report
* Automatically set the zuul-web url for logs.rdoproject.org
* Add tripleo-ci job tokenization
* Update README todo lists
* Fix typo in ARA Records Ansible
* Start with a capitalized pypi summary
* Move flake8 from test-requirements to the linter env

0.1.2
-----

* Fix iptable ignores typo
* Fix journald SYSLOG\_FACILITY
* Support using local path with job-run command
* Check for include-path change when reusing a model
* Add --test-prefix to enable execution on test instance
* Fix training speed debug output
* Improve files ignore list
* Skip empty files
* Add --static-location argument to set location of js/css files
* Improve journald integration
* Force job-output to be on top of the html report
* Do not write empty report
* Improve tokenizer
* Fix download-logs CLI

0.1.1
-----

* Fix new zuul-web api routes
* Fix typo in zuul-jobs role
* Fix incomplete journald entries
* Fix distance of duplicate lines
* Add journald support
* Command line interface refactor and filters fix
* Look for baseline in periodic, gate and check pipeline by default

0.1.0
-----

* CLI refactor and add zuul-jobs role
* Add hashing\_nn model and use it by default
* Add sub-command to command line interface
* Add lzma support
* Add KeyboardInterrupt exception handler

0.0.1
-----

* Add release job
* Add model info to the html output
* Move the file processing logic out of the cmd module
* Add zuul.yaml
* Improve reduction for tripleo-ci analysis
* Only display the relative paths and link to the original files
* Move setup\_logging to utils
* Improve HTML output
* Improve file grouping
* Refactor to better isolate the model from log processing
* Update README
* Add process-ci-events.py script
* Cleanup the code and remove the networking code
* Update README
* Fix direct download of gzip file
* Improvements for tripleo-ci logs processing
* Add a simple model and use it by default
* Add --model argument to test different strategies
* Fixed the too many opened files error
* Prettify html output using bootstrap
* Improve logs mirror support
* Do not match filename when there is only one baseline file
* Fix jenkins job info download
* Fix filename2modelname
* Move artifacts mirror code to utils
* Ignore DataDimensionalityWarning
* Rename --max-distance in --threshold + output improvements
* Rename --max-distance in --threshold + output improvements
* Strip comments starting by #
* Add support for console.html file
* Add html output
* Ignore dib\_build path
* Add after-context option
* Improve reporting
* Fix typo and cmd with target
* Import of loganamoly code
