Reprocessing Utility

Creation date: 9/27/2022 5:06 PM    Updated: 1/24/2023 7:10 PM
Introduced in Angelfish v2.44, the Reprocessing Utility makes it easy for Angelfish Admins to reprocess a Profile's data for a specified date range, without having to update the Datasources attached to the Profile.

Occasionally something happens that causes bad data in a Profile's reports, like:
  • Incorrect Profile or Filter settings
  • Web server changes (e.g. new log format)
  • Mandatory password updates (see Service Accounts)
  • Unexpected data in the logs

You can usually remove bad report data with the "Delete Options: Specific Visits" function in the Run/Data Management tab of each Profile...but this solution only deletes data from the reports.  The Reprocessing Utility is the thing to use if you need to add data to the reports, i.e. reprocess your log files.


REQUIREMENTS


In order to use the Reprocessing Utility, you need two things:
  • a date in the log file name
  • a valid entry in each Datasource's "File Naming Convention" field

1) Date in Log File Name

Most logs files contain a date stamp in the file name that indicates the date of the data contained in the logs.  IIS web servers create a daily log file with a date stamp in the file name - it looks like this:
  • u_ex221031.log

The date stamp in this file name is in YYMMDD format and indicates the log contains data from October 31, 2022.

If you use Apache / Tomcat / another web server, you can configure the server's logging feature to insert a date stamp to the log file name.  Please refer to your web server documentation for instructions on how to do this.

2) File Naming Convention

Each Datasource attached to the Profile needs a valid entry in the "File Naming Convention" field.  Angelfish uses the contents of this field to determine which logs need to be re-read.

The File Naming Convention field uses Date Substitution characters to specify the location of the date stamp in the log file names.  You can use wildcards in this field as well.


IIS Example:
  • Original file name:  u_ex221031.log
  • File Naming Convention: u_exYYMMDD.log

Apache Example:
  • Original file name:  access.log.2022-10-31.gz
  • File Naming Convention:   access.log.YYYY-MM-DD.gz

You can specify a 2 or 4 digit year (YY or YYYY), 2 digit month (MM), and 2 digit day (DD).


USING THE UTILITY


Click the Start Date & End Date fields to pop open a calendar, and select the relevant date range.  

We recommend leaving "enable log tracking" and "delete data before reprocessing" checked (enabled), unless you have a specific reason to uncheck either.  If you have questions, please open a support ticket.