NAME
Validate::CodiceFiscale - Validate an Italian "Codice Fiscale"
VERSION
This document describes Validate::CodiceFiscale version 0.004.
SYNOPSIS
One-liner:
$ perl -MValidate::CodiceFiscale=r -er RSSMRA98S03B833G
Module usage:
use Validate::CodiceFiscale qw< assert_valid_cf is_valid_cf validate_cf >;
my $valid = 'RSSMRA98S03B833G';
my $invalid = 'RSICRL99C51C967X';
# the first does not warn, the second does warn
eval { assert_valid_cf($valid); 1 } or warn "died: $@";
eval { assert_valid_cf($invalid); 1 } or warn "died: $@";
# plain boolean test, any error short-circuits
if (is_valid_cf($cf)) { ... }
# get everything that's wrong
if (my $errors = validate_cf($invalid)) {
say for $errors->@*; # array with error report, one line per error
}
# it's possible to pass additional validation options, like specific
# data. All are optional, if present they're validate, otherwise
# ignored.
assert_valid_cf($cf,
{
data => {
name => 'Foo',
surname => 'Bar',
sex => 'f',
date => '1998-03-11',
place => 'B833',
}
}
);
# the assertion short-circuits by default, failing at the first
# error. It's possible to check everyting and get a longer error
# message, in case.
assert_valid_cf($cf, { all_errors => 1 });
# it's also possible to wrap the error generation, by returning the
# exception to throw
assert_valid_cf($cf,
{
all_errors => 1,
on_error => sub {
my @errors = @_;
return "number of errors: $n_errors\n";
}
}
);
# of course, it's possible to throw the exception directly
use Ouch;
assert_valid_cf($cf, { on_error => sub { ouch 400, $_[0] } });
DESCRIPTION
This module performs partial validation of Italian Codice Fiscale,
mainly at the syntactic level. For proper and legally acknowledged
validation please refer to the official service by Agenzia delle
Entrate at
https://telematici.agenziaentrate.gov.it/VerificaCF/Scegli.do?parameter=verificaCf.
At the basic level, it allows spotting common errors that might come
from copying/typing the Codice Fiscale, e.g. typing a 0 (zero) instead
of an uppercase o letter, or similar errors with other digits/letters.
The validation can optionally take additional data regarding the name,
surname, birth date, birth place, and sex to perform a more thorough
validation based on the official rules. This part is subject to false
validations (negative or positive) for the following reasons:
* Places are generally validated against available data collected
from ANPR and ISTAT
, but they might be lacking. Additionally,
there is no attempt at matching similar names, or names that are
valid but e.g. expressed in their original language.
* The algorithm to generate the CodiceFiscale might lead to the same
string for different people. This is accounted for by using a
substitution of digits with letters, but only Agenzia delle Entrate
can tell whether the specific person holds the specific variant of
the code.
In the end, it provides a reaasonable way of spotting negatives, but it
might validate codes that would otherwise be considered invalid by the
reference authority because there is nobody tied to the otherwise
synctactically valid code.
INTERFACE
There are four main functions for doing checks, each targeting a
different style of use, plus an additional function that can be helpful
for one-liners.
assert_valid_cf
assert_valid_cf($cf, %options);
Check validity of the provided $cf and throw an exception if the check
fails.
By default, the exception is thrown:
* using Carp's croak with a message containing a string joining all
errors
* as soon as one of the validation checks fails.
Supported options:
all_errors
collect all errors to be fed into the exception, not only the first
one.
data
pass additional data for validation, as a hash reference optionally
containing keys name, surname, date, sex, and place.
on_error
wrap the exception generation with a sub reference used as a callback
with the following signature:
sub (@error_strings) { ... }
The sub can throw the exception itself; otherwise, its return value
will be used as the argument for die.
decode_cf
my $decoded = decode_cf($cf, %options);
Perform a full validation and provides back a data structure with the
parsed data and the outcome of all available tests. This is the most
complete form of validation that can be performed by this module.
Supported options:
data
pass additional data for validation, as a hash reference optionally
containing keys name, surname, date, sex, and place.
years_baseline
the baseline year for disambiguating two-digits. As an example,
people born in 1920 and people born in 2020 both have 20 as the year
part in their respective codes, so this allows setting the right
expectation. In lack of this parameter, whatever came last is
selected (in the example above, 20 would be interpreted as 2020).
The output data structure contains the following keys:
errors
pointing to an array reference with a list of validation errors (if
any);
portions
pointing to a hash reference with the disassemble of the input data,
if possible (it is always possible as long as the input length is 16
characters).
date
as YYYY-MM-DD)
place
as string of characters
sex
as a single-letter character, either F or M
year
month
day
as integers.
The decoded data can be useful to perform further validations, e.g. for
validating the place name in cases that are not covered by this module
(e.g. when expressed in a language different from Italian, etc.).
is_valid_cf
my $boolean = is_valid_cf($cf, %options);
Check the validity; it short-circuits at the first error. Returns a
boolean value.
It's possible to pass an additional key/value pair with key data and a
hash reference optionally containing keys name, surname, date, sex, and
place.
r
$ perl -MValidate::CodiceFiscale=r -er RSSMRA98S03B833G
The r function can be useful in one-liners to check one or more codes
from the command line. By default, the r function will read input codes
from @ARGV.
validate_cf
my $errors = validate_cf($cf, %options);
Check the validity and return undef if no errors were encountered, or a
reference to an array containing the list of errors otherwise.
Supported options:
all_errors
collect all errors to be fed into the exception, not only the first
one. This option defaults to a true value, to collect all errors;
it's possible to pass a false value to short-circuit and exit at the
first error.
data
pass additional data for validation, as a hash reference optionally
containing keys name, surname, date, sex, and place.
years_baseline
the baseline year for disambiguating two-digits. As an example,
people born in 1920 and people born in 2020 both have 20 as the year
part in their respective codes, so this allows setting the right
expectation. In lack of this parameter, whatever came last is
selected (in the example above, 20 would be interpreted as 2020).
SEE ALSO
String::CodiceFiscale is an alternative and precedent module to deal
with Codice Fiscale. I'm a bit scared about using a class method error
to collect validation errors, hence this module.
BUGS AND LIMITATIONS
Minimum perl version 5.24.
Report bugs through GitHub (patches welcome) at
https://github.com/polettix/Validate-CodiceFiscale.
AUTHOR
Flavio Poletti
COPYRIGHT AND LICENSE
This distribution is mainly composed of code, but it also includes data
(in the __DATA__ section of the module) that is derived from publicly
available data sources.
Code
Copyright 2023 by Flavio Poletti
Licensed under the Apache License, Version 2.0 (the "License"); you may
not use this file except in compliance with the License. You may obtain
a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
implied. See the License for the specific language governing
permissions and limitations under the License.
Data
The data contained in the __DATA__ section of the main module is
assembled from the following sources:
* Archive of Italian codes for places, including periods of validity,
from file ANPR_archivio_comuni.csv
,
available from Italian Ministero dell'Interno with license CC-BY 4.0
(according
to page Note legali
as of
2023-08-06). The file is accessible from page Archivio storico dei
comuni
.
* Archive of codes for foreign places, from file Elenco codici e
denominazioni delle unità territoriali estere
,
available from ISTAT with license CC-BY 3.0
(according to page
Note legali: responsabilità e licenza
as of 2023-08-06). The file is
accessible from page Codici delle unità territoriali estere
.