IsIdentifiable: A tool for detecting identifiable information in data sources

Thomas Nind (Developer), Andrew Brooks (Developer), James Sutherland (Developer), Ruairidh MacLeod (Developer)

Research output: Non-textual formSoftware

Abstract

A tool for detecting identifiable information in data sources. Out of the box supports:

- CSV
- DICOM
- Relational Database Tables (Sql Server, MySql, Postgres, Oracle)
- MongoDb

Rules base is driven by regular expressions and plugin services (e.g. Natural Language Processing). Also includes a reviewer/redactor tool for processing false positives and updating the rules base.
There is a standalone command line tool called ii for running directly or you can use the nuget package in your own code to evaluate data.
Original languageEnglish
Place of PublicationUnited States
PublisherGitHub
Media of outputOther
SizeSoftware
Publication statusPublished - 2 Aug 2022

Keywords

  • NLP
  • csharp
  • anonymisation
  • natural language processing
  • tool
  • library
  • nuget

Fingerprint

Dive into the research topics of 'IsIdentifiable: A tool for detecting identifiable information in data sources'. Together they form a unique fingerprint.

Cite this