bulkmarcimport.pl - Import bibliographic/authority records into Koha
$ export KOHA_CONF=/etc/koha.conf $ perl misc/migration_tools/bulkmarcimport.pl -d -commit 1000 \\ -file /home/jmf/koha.mrc -n 3000
Don't use this script before you've entered and checked your MARC parameters tables twice (or more!). Otherwise, the import won't work correctly and you will get invalid data.
This version/help screen
Type of import: bibliographic records
Type of import: authority records
The FILE to import
Verbose mode. 1 means "some infos", 2 means "MARC dumping"
Turn off foreign key checks during import.
The NUMBER of records to import. If missing, all the file is imported
File offset before importing, ie NUMBER of records to skip.
The NUMBER of records to wait before performing a 'commit' operation
File logs actions done for each record and their status into file
Test mode: parses the file, saying what he would do, but doing nothing.
Skip automatic conversion of MARC-8 to UTF-8. This option is provided for debugging.
The CHARACTERISTIC MARC flavour. At the moment, only MARC21 and UNIMARC are supported. MARC21 by default.
Delete EVERYTHING related to biblio in koha-DB before import. Tables: biblio, biblioitems, items
Input file FORMAT: MARCXML or ISO2709 (defaults to ISO2709)
Field store ids in FIELD (usefull for authorities, where 001 contains the authid for Koha, that can contain a very valuable info for authorities coming from LOC or BNF. useless for biblios probably)
FIELD matchindex,fieldtomatch matchpoint to use to deduplicate fieldtomatch can be either 001 to 999 or field and list of subfields as such 100abcde
If set, a search will be done on isbn, and, if the same isbn is found, the biblio is not added. It's another method to deduplicate. -match & -isbn can be both set.
Clean ISBN fields from entering biblio records, ie removes hyphens. By default, ISBN are cleaned. --nocleanisbn will keep ISBN unchanged.
Source bib TAG for reporting the source bib number
Source SUBFIELD for reporting the source bib number
FILE for the koha bib and source id
Store ids in 009 (usefull for authorities, where 001 contains the authid for Koha, that can contain a very valuable info for authorities coming from LOC or BNF. useless for biblios probably)