# Configuration options

ezPAARSE comes with a config.json file (located at the root of the application directory) that contains the default configuration. You can override it by creating a config.local.json file with your own settings. Beware that modifying the config.json file may prevent ezPAARSE from updating correctly.

# EZPAARSE_APP_NAME

ezPAARSE display name. As of now, this is only used in email subjects.

# EZPAARSE_ADMIN_MAIL

The default value is set to ezpaarse@couperin.org

# EZPAARSE_HOSTNAME

Domain name of the ezPAARSE instance.

# EZPAARSE_PARENT_URL

To avoid the setup of a local SMTP server, you can delegate the management of user feedback (via the online form) to another ezPAARSE instance (called a "parent" instance). The default value is set to http://ezpaarse-preprod.couperin.org

# EZPAARSE_SMTP_SERVER

If you want to use a specific SMTP server to send emails, set the value to a JSON object that is compatible with nodemailer options (opens new window).

# Example

{
  "EZPAARSE_SMTP_SERVER": {
    "host": "smtp.intra.org",
    "port": 25
  }
}

# EZPAARSE_FEEDBACK_RECIPIENTS

The mail adress where the users feedback get sent. The default value is set to ezpaarse@couperin.org

# EZPAARSE_SUBSCRIPTION_MAIL

If you wish to receive a message everytime a user opens an account on your instance, set the value to true. The default value is set to false

# EZPAARSE_MONGO_URL

The default value is set to mongodb://localhost:27017/ezpaarse

# EZPAARSE_ENV

The default value is set to production

# EZPAARSE_NODEJS_PORT

The default value is set to 59599

# EZPAARSE_NODEJS_VERSION

The default value is set to 14.17.6

# DEFAULT_LOCALE

The default value is set to fr

# EZPAARSE_OUTPUT_FIELDS

Contains an array of field names that are going to be present in the result file produced by ezPAARSE. The default array contains the following fields:

[
"datetime",
"date",
"login",
"platform",
"platform_name",
"publisher_name",
"rtype",
"mime",
"print_identifier",
"online_identifier",
"title_id",
"doi",
"publication_title",
"unitid",
"domain"
]

# EZPAARSE_DEMO

If true, it shows a warning informing users that the instance is a demo, and thus not adapted to process large log files. This warning now appears on our demo instance hosted on http://ezpaarse.org (opens new window) The default value is set to false.

# EZPAARSE_DEFAULT_HEADERS

An object representing default headers to be used for each job. Can be overriden by predefined settings and actual job headers.

# Example

{
  "EZPAARSE_DEFAULT_HEADERS": {
    "Crypting-Salt": "OU0qTpLOmC"
  }
}

# EZPAARSE_MIDDLEWARES"

Contains an array of middleware names, in the order they are going to be launched by ezPAARSE during a process. The default array contains the following middlewares:

[
  "filter",
  "parser",
  "deduplicator",
  "enhancer",
  "istex",
  "crossref",
  "sudoc",
  "hal",
  "geolocalizer",
  "field-splitter",
  "qualifier",
  "cut",
  "anonymizer"
]

# EZPAARSE_QUALIFYING_LEVEL

This sets the minimal value, under which ezPAARSE considers an EC is not qualified enough to be written to the results. The default value is set to 1

# EZPAARSE_QUALIFYING_FACTORS

{
    "internal": {
      "rtype": 0.5,
      "mime": 0.5
    },
    "external": [
      {
        "file": "platforms/fields.json",
        "sublist": "rid",
        "attribute": "code",
        "weight": 1
      }
    ]
  }

# EZMESURE_INSTANCES

[v3.8.0 and above]

The list of ezMESURE instances that can be used for uploading ezPAARSE results. Each instance should have an arbitrary id, a label and a baseUrl. An optional options object can be provided, which can be used to add headers and query parameters when uploading files, or disable certificates verification by setting strictSSL to false.

Example:

[
  { "id": "prod", "label": "ezMESURE", "baseUrl": "https://ezmesure.couperin.org" },
  { "id": "integ", "label": "ezMESURE - Préproduction", "baseUrl": "https://ezmesure-preprod.couperin.org" },
  {
    "id": "dev",
    "label": "ezMESURE - Local developpement instance",
    "baseUrl": "https://localhost",
    "options": {
      "headers": { "Split-Fields": "ezproxy-groups(+)" },
      "qs": { "store": false },
      "strictSSL": false
    }
  }
]

# EZPAARSE_TMP_CYCLE

Determines how long ezPAARSE results remain accessible for downloading. The default value is set to 60min

# EZPAARSE_TMP_LIFETIME

Sets the maximal duration for the storage of result files. The default value is set to 1day

# EZPAARSE_IGNORED_DOMAINS

Contains an array of domains to be ignored (ie filtered out) by ezPAARSE.

[
  "www.google.fr",
  "www.google.com"
]

To avoid declaring too long a list, we advise you to declare unrelevant domains in dedicated exclusion files as documented in this section (opens new window)

# EZPAARSE_GEOLOCALIZE_DEFAULT

The default value is set to geoip-lookup

# EZPAARSE_GEOLOCALIZE_SEPARATOR

The default value is set to . (dot)

# EZPAARSE_ALERTS

Contains an object with 2 member properties, listed here:

{
  "activationThreshold": 1000,
  "unknownDomainsRate": 10
}

The properties and their values are documented in further details in the relevant section (opens new window)