Skip to content

v1.46 Autumn Additions

Latest
Compare
Choose a tag to compare
@stijn-uva stijn-uva released this 14 Oct 10:51
· 7 commits to master since this release

⚠️ Please follow these instructions for upgrading if you have trouble upgrading to the latest version of 4CAT. ⚠️

We also recommend reading the instructions above if you are running 4CAT via Docker and have in any way deviated from the default set-up, or see error messages in the log file when upgrading via the web interface.

Otherwise, you can upgrade 4CAT via the 'Restart or upgrade' button in the Control Panel. This release of 4CAT incorporates the following fixes and improvements:

  • Added support for extensions, modular additions to 4CAT that can be put in the /extensions/ folder in the 4CAT root (#451)
  • Added a processor to download 4CAT datasets as a Zip file, and updated the 'Import dataset' data source to allow loading these zip files as new datasets (#452)
  • Added a data source for Threads, to allow importing Threads data via Zeeschuimer (a68f5d6)
  • Added a processor for LLM-powered text coding via the DMI Service Manager (693960f)
  • Added an option to the Telegram data source to crawl based on mentions and links in addition to forwarded messages (8f2193c)
  • Added razdel as a tokeniser to the Tokenise processor for tokenising Russian text (0b74569)
  • Added an option to the 'Word trees' processor to allow selecting which column(s) to read text from (e4c0099)
  • Added more stopwords corpora to the Tokeniser and allow using multiple at the same time - by default the one for the chosen text language is used (b9a327a)
  • Added more 'auto-fill' options when importing CSV files (empty values, or the current date and time) (9bd9da5)
  • Added a warning to the 'Media upload' data source when trying to upload too many files at once (ffcb6a4, e4f982b, e304649)
  • Added more indicative dataset status updates when running DMI Service Manager-powered processors (eb76937)
  • Added support for previewing HTML datasets in the web interface (203314e)
  • Added configuration settings to toggle display of Anonymisation controls on the 'Create dataset' page (0945d8c)
  • Added configuration setting to toggle display of the 'you can install 4CAT yourself' message in the login form (cd356f7)
  • Added a feed of the official 4CAT BlueSky account to the 4CAT 'Home' page (3d94b66)
  • Added a delay to the worker that cleans up expired and orphaned datasets to wait 7 days before actually deleting an orphaned dataset (bfaf23b)
  • Fix a crash in the 'Image category wall' processor (ebf39d8)
  • Fix a crash in the 'Google Vision API' processor when running it on an empty dataset (fb09162)
  • Fix a crash in the 'Video hashes' processor when running it on a dataset with no .metadata.json file (d41fa34)
  • Fix a crash in the 'Download images' processor when trying to download images from a malformed URL (579ff64)
  • Fix a crash in the 'Download videos' processor when trying to extract video URLs from a non-text data field (e9b5232)
  • Fix a crash in the 'Hatebase' processor (4ba872b)
  • Fix a rare race condition when running 4CAT via Docker (#396)
  • Fix an issue in the front-end where an incomplete list of available processors was shown in some situations (4323946)
  • Fix an issue in the Telegram data source where it would indicate that the 'app' needs updating to log in (d2a787e, 346150b)
  • Fix an issue in the Telegram data source where crawl depth parameters would not be interpreted correctly (1c0bf5e, #444)
  • Fix an issue in the Telegram data source where some post attributes were not read correctly (2c8c860, 959710a, c67a046)
  • Fixed an issue where the link to a newly created dataset on the 'Create dataset' page would not always work (b542ded)
  • Fixed an issue where configuration tags with no associated users could get deleted (d6064be)
  • Fix an issue in the LinkedIn data source where image URLs would not always be parsed correctly (c27fbbe)
  • Fix an issue in the Douyin data source where stream URLs would not always be parsed correctly (d769be4)
  • Remove Spacy-powered text analysis processors (48c20c2)
  • Remove the Parler data source (ee7f434)
  • Update dependences (#450, a269f96, d2a787e)

Full Changelog: v1.45...v1.46