WARNING: C Toolkit is deprecated, please use C++ Toolkit instead


Warning, /config/README is written in an unsupported language. File is not indexed.


Beginning with Release 2.0 of the Entrez:Sequences and Pre-Release 1.0 of the
Entrez:MEDLINE CD-ROMs, it is possible for the Entrez application to access
one or both data sets. This will allow users to potentially have access to the
rich set of MEDLINE articles on the Entrez:MEDLINE disc, while simultaneously
having access to the sequence data on the Entrez:Sequences disc.

This functionality requires both
 i) a new version of the Entrez application software, and
ii) a more complex configuration file, to handle the wide variety of
    possible configurations.

Users who wish to use only _one_ of the Entrez:Sequences or Entrez:MEDLINE discs
(never in combination) will not need to make any changes in their configuration
files; the new Entrez software is backwards compatible with these old
configuration files. If you fall into this catagory and have already installed
Entrez on your machine, then there is no need to read further.

Note that the complex configuration mechanism which is discussed below is
not currently supported. We would, however, appreciating hearing about any
problems which you may encounter. A supported version with a user-friendly
configuration program will be provided with Release 3.0 of the Entrez:Sequences
CD-ROM.

Also note that, because there will be no pre-Release 2.0 Entrez:MEDLINE CD-ROM,
there will be a few MEDLINE abstracts which will be present on the latest
Entrez:Sequences CD-ROM (Release 2.0), but not on the latest Entrez:MEDLINE
CD-ROM (Pre-Release 1.0). This is a deviation from future CD-ROM releases,
when the MEDLINE records on the latest Entrez:Sequences CD-ROM will be a
proper subset of those sequences on the latest Entrez:MEDLINE CD-ROM.

Users who wish to use both CD-ROMs on a single CD-ROM drive are advised to
make sure that they have two CD-ROM caddies available (one for each CD).
Frequent switching of CD-ROMs between a single caddy and the CD jewel boxes can
induce high levels of stress.

Seven pre-canned configuration files are made available, with corresponding
.ini and .cnf files for MS Windows and the Macintosh, respectively. The
selected file should be modified as appropriate for your machine, renamed
to NCBI.INI or ncbi.cnf, and used to replace the NCBI.INI or ncbi.cnf file, as
outlined on the Entrez manual's installation instructions.


The pre-canned configuration files are as follows:

    NCBISM1D.XXX      MEDLINE and Sequence CDs, with only one CD-ROM drive
    NCBISM2D.XXX      MEDLINE and Sequence CDs, with two CD-ROM drives
    NCBIMO.XXX        MEDLINE CD only
    NCBISO.XXX        Sequence CD only
    NCBISHMC.XXX      Harddisk-based Sequence CD image, and MEDLINE CD-ROM
    NCBISCMH.XXX      Harddisk-based MEDLINE CD image, and Sequence CD-ROM
    NCBISHMH.XXX      MEDLINE CD and Sequence CD images, both on hard disk

The following customizations may be necessary for your machine:

* For all files, the only changes to be made will be within the first 20-or-so
  lines of the configuration file, within the "NCBI" section and the media
  sections "ENTREZ_xxx_yD" (where xxx is one of "SEQ" or "MED", and y is one of
  "C" or "H").

* A copy of the CDROMDAT.VAL for each CD-ROM to be used must be stored on the
  hard disk, in the directory pointed to by the "VAL" field for the
  corresponding media. Note that these files are _different_ for the Sequence
  and MEDLINE CD-ROMs, and the correct file must be stored in each prescribed
  location.

* For improved performance, a copy of the index files should be copied onto
  the hard disk for each CD-ROM to be used, if space is available. These
  canned configuration files assume the availability of such index files on a
  hard disk. If you choose not to install the index files, then remove the
  "IDX=" lines from the configuration file which you have selected. Again,
  note that the index files are _different_ for the Sequence and MEDLINE
  CD-ROMs, and the correct files must be stored in each prescribed location.

* For MS-Windows, it is assumed that the first CD-ROM drive is drive D and the
  second CD-ROM drive (NCBISM2D.INI only) is drive E. Change this as necessary.

* For Macintosh systems, the hard disk name yourHardDisk should be changed to
  the name of your hard disk.

* For both types of systems, it is assumed that all hard disk files reside on
  the same hard disk. Change this as necessary.

* If you choose to copy both CD-ROMs to your hard disk (NCBISHMH.XXX), then
  you need not copy the medline directory from the Entrez:Sequences CD-ROM
  onto your hard disk, since this portion of MEDLINE is a proper subset of
  the MEDLINE on the Entrez:MEDLINE CD-ROM.

* Note that a copy of the appropriate seven configuration files for your
  platform (Mac or Windows) will be automatically copied by the installation
  procedure into the ENTREZ\CONFIG folder. You may, however, find an alternate
  copy of the configuration files on the CD-ROM in the SOFTWARE\CONFIG folder,
  inside the MAC and WIN folders.

* Note that the "ROOT=" field in the "[NCBI]" section of the multi-source
  configuration files (ncbi????.XXX) is not used by this version of Entrez,
  but is provided for backwards compatability with older versions of Entrez,
  as well as for compatability with other applications using the older
  version of our data access libraries.

* Because the MEDLINE entries on Release 2.0 of the Entrez:Sequences CD-ROM
  are not a proper subset of the entries on pre-Release 1.0 of the
  Entrez:MEDLINE CD-ROM, it may be necessary to set an additional configuration
  parameter, if you begin to encounter "Missing UID" errors when running
  Entrez. This is the only parameter which should be set in the
  entrez.[cnf/ini] configuration file; all other parameters should be
  set in the ncbi.[cnf/ini] configuration file. The parameter which should
  be set is "SHOWALLERRORS=FALSE", in the [PREFERENCES] section. It is
  strongly recommended that you first get your configuration of Entrez
  running properly, before adding this line to your entrez.[cnf/ini] file.
  This configuration option will mute many errors, which may make it
  difficult to debug your configuration difficulties.
  


EXAMPLE

You wish to use both CD-ROMs, on a single CD-ROM drive, under MS Windows.
Suppose that the device for your CD-ROM drive is named "F:", not "D:"

Install Entrez from Release 2.0 of the Entrez:Sequences CD-ROM, per the
installation/update instructions in the manual.

Make a copy of ncbism1d.ini from \ENTREZ\CONFIG\NCBISM1D.INI, first having
saved a copy of NCBI.INI (if you had one):

    COPY C:\WIN\NCBI.INI C:\WIN\NCBIINI.BAK
    COPY C:\ENTREZ\CONFIG\NCBISM1D.INI C:\WIN\NCBI.INI

Edit C:\WIN\NCBI.INI with your favorite editor, and change the occurrences
of ROOT=D:\ to ROOT=F:\. Save your changes and exit the editor.

Create some directories, if they don't already exist:
    MKDIR C:\ENTREZ\MED
    MKDIR C:\ENTREZ\MED\INDEX
    MKDIR C:\ENTREZ\SEQ
    MKDIR C:\ENTREZ\SEQ\INDEX

Copy the sequence index files, and CDROMDAT.VAL to your hard disk.
    COPY F:\INDEX\*.* C:\ENTREZ\SEQ\INDEX
    COPY F:\CDROMDAT.VAL C:\ENTREZ\SEQ

Now, eject the Entrez:Sequences CD, insert the Entrez:MEDLINE CD, and
copy the MEDLINE index files and CDROMDAT.VAL to your hard disk.
    COPY F:\INDEX\*.* C:\ENTREZ\MED\INDEX
    COPY F:\CDROMDAT.VAL C:\ENTREZ\MED

Now, start-up Windows (if it's not already running), and launch Entrez.
It doesn't matter which CD-ROM, if any, is inserted into the CD-ROM drive
(although, for convenience, it generally makes sense to insert the CD-ROM
which you would like to use first). The Entrez application will inform
you when it is time to insert the other CD-ROM.

NOTES

When using two CD-ROMs on a single CD-ROM drive, the Macintosh version will
automatically eject the CD-ROM which is currently inserted. Ejection
must be performed manually for the Microsoft Windows version.

Ejecting a CD-ROM at times other than that directed by the Entrez application
may result in undesirable effects.

If you _must_ eject a CD-ROM on the Macintosh when Entrez is running, it
is important to drag the CD-ROM icon to the trash can, rather than using
the Eject selection from the Finder's FILE menu. The latter may result in 
undesirable effects.



The remainder of this document is a technical discussion which should not
be necessary for the reader who only wants to install Entrez on their system.


----------------------------------------------------------------------------

                           TECHNICAL DISCUSSION

The new configuration files consist of a three-level structure. This
hierarchy is implemented by using unique user-specified names for sections
within the configuration file, as well as some reserved section names.

The top-level of the hierarchy consists of three reserved-named sections,
"MEDLINE", "SEQUENCE", and "LINKS", each of which contain a single _field_,
"CHANNELS". Channels are used to specify the mechanisms by which the
corresponding types of data can be obtained. For example, considering the
Entrez:MEDLINE and Entrez:Sequences CDs, it is possible to obtain some
MEDLINE information from either CD, but Sequence information may only be
obtained from the Entrez:Sequences CD. Therefore, the value of the channels
field for MEDLINE will contain two user-defined channel names, but the value
of the channels field for SEQUENCE will only contain one such channel name.

Each name listed on the right-hand-side of "CHANNELS=" must corresponding
to a section-name at the second-level of the hierarchy; the "Channels"
level. Each Channel-level entry consists of a list of priorities for 
the possible types of data associated with that channel. Priorities are used
by the Entrez software to determine which Channel it should attempt to
use for obtaining the corresponding data. A priority of 0 indicates that
this channel should never be used to obtain this data. For positive values,
a higher priority indicates a preference for that data channel. For example,
the Channel for obtaining MEDLINE records from the Entrez:Sequences disc
might have priority 1, while the corresponding Channel associated with
the Entrez:MEDLINE disc might have priority 2, because the latter is a better
source for this data. The integer-valued priorities may optionally be
followed by a comma and the keyword "NO_DRASTIC_ACTION". This means that,
if the priority for this channel is higher than any other, but a "drastic
action" would need to be taken to make this channel active (like ejecting
a different CD-ROM), then a channel with lower priority may be deferred to
(e.g., if the corresponding CD-ROM is currently inserted).

The possible data types for MEDLINE and SEQUENCE channels are:
    RECORDS   - Entire MEDLINE abstracts, or Sequence entries; these
                corresponding to double-clicking on a document summary
                in the Documents window
    DOCSUMS   - A document summary; these appear as a scrolled list in
                the document summary window
    TERMS     - These are the terms specified in term selection in the
                Query window
    BOOLEANS  - This is the operation performed during query refinement
The default priority for an unreferenced data type (e.g., TERMS) is 1.

A LINKS channel consists of an "INFO" priority, used to access global
information about Entrez status, and a set of links relationships. Links
relationships are specified by the "from" name, followed by two underscores,
followed by the "to" name. For example, the name for MEDLINE to SEQUENCE
links is "MEDLINE__SEQUENCE". The default priority for these double-underscored
names is 0, while the default priority for "INFO" is 1.

Each Channel section must also contain a "MEDIA" field. This references
the lowest-level level in the hierarchy, Media.

A Media, in turn, contains a "TYPE" field (currently this value must be either
CD or HARDDISK), and a set of fields which correspond to much of the original
set of fields which appeared in the "NCBI" section of the old-style
configuration field (namely: IDX, ROOT, etc.). There is an additional field,
"VAL", which must point to the filename for CDROMDAT.VAL. The default value for
"VAL" is the value specified by "ROOT".

A media section must also contain the field "FORMAL_NAME", which is
the formal name to be used for that media when the software addresses the
user (e.g. "Entrez:MEDLINE CD-ROM"). A media section may also contain
one or more fields of the form "DRASTIC_TO_mmm=1", where mmm is the
section name of another media. This is used in conjunction with the
"NO_DRASTIC_ACTION" option which may appear in some Channel fields.
For example, within a Media section "ENTREZ_MED_CD", the field
"DRASTIC_TO_ENTREZ_SEQ_CD" means that it is considered to be a drastic
action to switch from the Entrez:MEDLINE CD-ROM to the Entrez:Sequences
CD-ROM.

The "NCBI" section  must contain the DATA and ASNLOAD entries, indicating
where the data files and ASN.1 object loader definitions are to be found.
In addition, the "NCBI" section must contain a "MEDIA" field, which is
a comma-separated list of all the Media which will be used. Note that
this constitutes a deviation from the 3-level model mentioned earlier.

A discussion of the pathname redirection used in both old and new-style
configuration files is in order here, since it has never been fully
documented on earlier CD-ROMs or CD-ROM documentation. The pathname
specification field names are: "ROOT", "IDX", "TRM", "MED", "SEQ", "LNK".
All pathnames default as being relative to the directory specified by
"ROOT", which is a mandatory field. The remaining fields, which are all
optional, override the pathname specified by "ROOT" for a specific set of files,
as follows:
* IDX    - Index files
* TRM    - Term list files (and their associated indices and "posting files")
* MED    - ASN.1 data for MEDLINE documents
* SEQ    - ASN.1 data for sequence documents
* LNK    - Links among related documents
* 
0001
0002
0003
0004
0005
0006
0007
0008
0009
0010
0011
0012
0013
0014
0015
0016
0017
0018
0019
0020
0021
0022
0023
0024
0025
0026
0027
0028
0029
0030
0031
0032
0033
0034
0035
0036
0037
0038
0039
0040
0041
0042
0043
0044
0045
0046
0047
0048
0049
0050
0051
0052
0053
0054
0055
0056
0057
0058
0059
0060
0061
0062
0063
0064
0065
0066
0067
0068
0069
0070
0071
0072
0073
0074
0075
0076
0077
0078
0079
0080
0081
0082
0083
0084
0085
0086
0087
0088
0089
0090
0091
0092
0093
0094
0095
0096
0097
0098
0099
0100
0101
0102
0103
0104
0105
0106
0107
0108
0109
0110
0111
0112
0113
0114
0115
0116
0117
0118
0119
0120
0121
0122
0123
0124
0125
0126
0127
0128
0129
0130
0131
0132
0133
0134
0135
0136
0137
0138
0139
0140
0141
0142
0143
0144
0145
0146
0147
0148
0149
0150
0151
0152
0153
0154
0155
0156
0157
0158
0159
0160
0161
0162
0163
0164
0165
0166
0167
0168
0169
0170
0171
0172
0173
0174
0175
0176
0177
0178
0179
0180
0181
0182
0183
0184
0185
0186
0187
0188
0189
0190
0191
0192
0193
0194
0195
0196
0197
0198
0199
0200
0201
0202
0203
0204
0205
0206
0207
0208
0209
0210
0211
0212
0213
0214
0215
0216
0217
0218
0219
0220
0221
0222
0223
0224
0225
0226
0227
0228
0229
0230
0231
0232
0233
0234
0235
0236
0237
0238
0239
0240
0241
0242
0243
0244
0245
0246
0247
0248
0249
0250
0251
0252
0253
0254
0255
0256
0257
0258
0259
0260
0261
0262
0263
0264
0265
0266
0267
0268