WARNING: C Toolkit is deprecated, please use C++ Toolkit instead


Warning, /sequin/README is written in an unsupported language. File is not indexed.

SEQUIN APPLICATION

Sequin is a program designed to aid in the submission of sequences to the
GenBank, EMBL, and DDBJ sequence databases. It was written at the National
Center for Biotechnology Information, part of the National Library of
Medicine at the National Institutes of Health.

Sequin can assemble the essential elements of a GenBank record from simple
FASTA-format text files. For example, the program obtains the proper genetic
code from an organism name, and automatically determines coding region
intervals by back-translation from the protein sequence. An on-line help
window scrolls to the appropriate place as the user moves between and within
data entry forms, giving relevant details on what information is expected.

Sequin also contains a number of built-in validation functions for quality
assurance. Features such as splice sites and coding region translations are
checked for accuracy or internal consistency. Double-clicking on an error
message launches an appropriate editor by which the user can correct any
problems.

Sequin provides live, clickable views of the data in a variety of formats,
including a report form, GenBank flatfile, EMBL flatfile, and a graphical
view. Double clicking on an item in any of these formats launches an editor
for that item. The editor is capable of maintaining correct feature table
positions as the underlying sequence is edited. It can display features on
the sequence during editing, and allows feature intervals to be adjusted by
direct manipulation.

Sequin runs on Macintosh, PC/Windows, UNIX and VMS computers and is available
by anonymous ftp from ftp.ncbi.nih.gov.


ADDING ANALYSIS PROGRAMS TO SEQUIN

The seqncfig file can be configured to add items to Sequin's Analysis menu.
Each section gives instructions on how to constuct a URL query to send to a
cgi program on the Internet. The cgi program is then responsible for running
the desired analysis program and for any necessary format conversions. A
sample cgi program written in C, testcgi.c, is in the demo directory of the
NCBI software toolkit. A perl version will be available soon. (Some analysis
programs could be modified to act as cgi programs directly, and for those
cases a separate cgi mediator program would not be necessary.)

A portion of a sample seqncfig file is shown below. Note that the section
names for the services are specified in the [ORDER] section, and items must
be of the form ORDER_1, ORDER_2,...ORDER_n.

    [ORDER]
    ORDER_1=tRNAscan
    ORDER_2=Seg
    ...

    [tRNAscan]
    HOST=www.myserver.myschool.edu
    PORT=80
    PATH=/MyServices/cgi-bin/testcgi.cgi
    QUERY=request=trnascan
    TITLE=tRNAscan-SE
    SUBMENU=Search
    FORMAT=FASTA
    FLAGS=SEQ,NUC,TRG,NET
    TIMEOUT=30

    [Seg]
    HOST=www.myserver.myschool.edu
    PORT=80
    PATH=/MyServices/cgi-bin/testcgi.cgi
    QUERY_1=request=seg
    TITLE_1=Seg default
    QUERY_2=request=seg&window=10&lowcut=1.0&hicut=1.5
    TITLE_2=Seg 10-1.0-1.5
    QUERY_3=request=seg&window=12&lowcut=2.3&hicut=2.6
    TITLE_3=Seg 12-2.3-2.6
    SUBMENU=Seg
    FORMAT=FASTA
    FLAGS=SEQ,DOC,PRT,TRG,NET
    TIMEOUT=30
    ...

    [ENZYMES]
    ENZ_1=BamHI
    ENZ_2=EcoRI
    ENZ_3=HindIII

The naming conventions for configuration files varies depending upon the
computer platform. For Mac it is seqncfig.cnf, for Windows seqncfig.ini, for
UNIX .seqncfigrc, and for VMS sequincfig.cfg. On the Mac, configuration files
go in the System Folder:Preferences folder, while for the PC they go in the
"windows" directory.

Individual configuration files may now be placed in a "services" directory
that is in the same directory as the Sequin program. They do not need to
conform to the naming convention described above.


SOLARIS EXECUTABLES

The Solaris executables distributed by NCBI no longer include a statically-
linked Motif library. This is due to the manner in which Sun distributes
Motif libraries. Users of Solaris version 2.4 and higher will find the
runtime shared Motif library on their system. Users of Solaris version 2.3
and earlier may need to either upgrade their systems or purchase the Motif
library separately.

It may be necessary to set your LD_LIBRARY_PATH environment variable to point
to the directory where this file, libXm.so.3 (the final digit may vary),
appears on your system. E.g.:

   setenv LD_LIBRARY_PATH /usr/dt/lib
0001
0002
0003
0004
0005
0006
0007
0008
0009
0010
0011
0012
0013
0014
0015
0016
0017
0018
0019
0020
0021
0022
0023
0024
0025
0026
0027
0028
0029
0030
0031
0032
0033
0034
0035
0036
0037
0038
0039
0040
0041
0042
0043
0044
0045
0046
0047
0048
0049
0050
0051
0052
0053
0054
0055
0056
0057
0058
0059
0060
0061
0062
0063
0064
0065
0066
0067
0068
0069
0070
0071
0072
0073
0074
0075
0076
0077
0078
0079
0080
0081
0082
0083
0084
0085
0086
0087
0088
0089
0090
0091
0092
0093
0094
0095
0096
0097
0098
0099
0100
0101
0102
0103
0104
0105
0106
0107
0108
0109