BLASTX+BEAUTY Search Results

WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.

BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.

BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract

Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract



RepeatMasker repeats found in sequence:

No Repeats Found.

Reference:  Gish, Warren (1994-1997).  unpublished.
Gish, Warren and David J. States (1993).  Identification of protein coding
regions by database similarity search.  Nat. Genet. 3:266-72.

Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.

Query= 'D03E02.seq' (302 letters)

  Translating both strands of query sequence in all 6 reading frames

Database: nr 625,274 sequences; 197,782,623 total letters.



     Observed Numbers of Database Sequences Satisfying
    Various EXPECTation Thresholds (E parameter values)

        Histogram units:      = 43 Sequences     : less than 43 sequences

 EXPECTation Threshold
 (E parameter)
    |
    V   Observed Counts-->
  10000 10443 2281 |=====================================================
   6310  8162 1524 |===================================
   3980  6638 2567 |===========================================================
   2510  4071 1931 |============================================
   1580  2140  570 |=============
   1000  1570  430 |==========
    631  1140  277 |======
    398   863  173 |====
    251   690  161 |===
    158   529  118 |==
    100   411  114 |==
   63.1   297   58 |=
   39.8   239   36 |:
   25.1   203   31 |:
   15.8   172   24 |:
 >>>>>>>>>>>>>>>>>>>>>  Expect = 10.0, Observed = 148  <<<<<<<<<<<<<<<<<
   10.0   148   28 |:
   6.31   120   24 |:
   3.98    96   31 |:
   2.51    65   17 |:
   1.58    48    8 |:
   1.00    40    9 |:
   0.63    31    8 |:
   0.40    23    5 |:
   0.25    18    7 |:
   0.16    11    4 |:
   0.10     7    1 |:
  0.063     6    1 |:
  0.040     5    0 |
  0.025     5    0 |
  0.016     5    1 |:
  0.010     4    1 |:
 0.0063     3    0 |
 0.0040     3    0 |
 0.0025     3    2 |:


                                                                     Smallest
                                                                       Sum
                                                     Reading  High  Probability
Sequences producing High-scoring Segment Pairs:        Frame Score  P(N)      N
gi|461707|sp|P34687|CC34_CAEELCUTICLE COLLAGEN 34 >gi... +1    71  0.00086   2
gi|115399|sp|P16252|CAC2_HAECOCUTICLE COLLAGEN 2C >gi... +1    70  0.0024    2
gi|321007|pir||B44984collagen - nematode (Haemonchus ... +1    70  0.0024    2
gi|7500590|pir||T29956hypothetical protein F36A4.10 -... +1    71  0.0090    2
gi|7331814|gb|AAF60502.1|(AC006743) contains similari... +1    89  0.014     1
gi|9625710ref|NP_039959.1| UL25 FAMILY [human herpesv... -1    71  0.039     2
gi|2978419|gb|AAC06113.1|(M63709) alpha-1 type II col... +1    75  0.080     1
gi|3242649|dbj|BAA29028.1|(AB015440) alpha 1 type I c... -1    79  0.099     2
gi|115267|sp||CA11_BOVIN_2[Segment 2 of 2] COLLAGEN A... -1    71  0.11      2
gi|4836662|gb|AAD30510.1|AF129925_4(AF129925) CsoS2 [... -1    72  0.13      2
gi|227093|prf||1614239Acollagen alpha1(V) 66-267 [Hom... +1    77  0.14      1
gi|263810|gb|AAB24972.1|collagen alpha chain [Riftia ... -1    70  0.15      2
gi|399170|sp|P30754|CAFF_RIFPAFIBRIL-FORMING COLLAGEN... -1    70  0.15      2
gi|2119156|pir||S28774collagen alpha chain - tube wor... -1    70  0.15      2
gi|7499772|pir||T21314hypothetical protein F23H12.4 -... +1    79  0.17      1
gi|476420|pir||CGBO1Scollagen alpha 1(I) chain - bovi... -1    71  0.17      2
gi|7441988|pir||T16984transcription factor homolog BT... +3    74  0.19      1
gi|7516580|pir||C72637hypothetical protein APE1554 - ... -2    71  0.20      1
gi|71405|pir||CGCH1Scollagen alpha 1(I) chain - chick... -1    70  0.24      2
gi|115397|sp|P08124|CC01_CAEELCUTICLE COLLAGEN 1 >gi|... +1    77  0.25      1
gi|180882|gb|AAA52053.1|(M21353) alpha-2 type I colla... +3    42  0.27      2
gi|5732934|gb|AAD49346.1|(AF169346) pro-alpha-1 type ... +1    75  0.27      1
gi|115411|sp|P17657|CCDC_CAEELCUTICLE COLLAGEN DPY-13... +1    67  0.30      2
gi|7504118|pir||T22607hypothetical protein F54B11.1 -... +1    65  0.34      2
gi|83714|pir||JN0254qutH protein - Emericella nidulans   +3    77  0.34      1
gi|192264|gb|AAA37334.1|(M17491) procollagen type I a... +1    77  0.36      1
gi|3288493|emb|CAA75878.1|(Y15918) COL1A1 and PDGFB f... -1    71  0.40      1
gi|10440430|dbj|BAB15748.1|(AK024458) FLJ00050 protei... -2    74  0.42      1
gi|115268|sp|P02457|CA11_CHICKCOLLAGEN ALPHA 1(I) CHA... -1    70  0.42      2
gi|2136732|pir||I45876collagen alpha 1(II) chain - bo... +1    67  0.45      1
gi|159960|gb|AAA29439.1|(M24558) collagen-like protei... +1    74  0.45      1
gi|7510834|pir||T28999hypothetical protein ZC513.8 - ... +1    74  0.48      1
gi|930045|emb|CAA33387.1|(X15332) alpha-1 (III) colla... +1    68  0.50      2
gi|553198|gb|AAA51816.1|(M31731) bcl-3 protein [Homo ... +1    66  0.53      1
gi|5921192|sp|P02467|CA21_CHICKCOLLAGEN ALPHA 2(I) CH... +1    69  0.53      2
gi|192262|gb|AAA37333.1|(M14423) pro-alpha-1 type I c... +1    77  0.54      1
gi|8922952ref|NP_060836.1| hypothetical protein FLJ11... -1    71  0.55      1
gi|2143752|pir||I60384gene T1 protein - rat (fragment... +2    41  0.57      2
gi|2119160|pir||I50629collagen - chicken (fragment) >... +1    75  0.61      1
gi|7508879|pir||T28770hypothetical protein W03D2.1 - ... +1    68  0.63      2
gi|11437170ref|XP_003597.1| hypothetical protein FLJ1... -1    71  0.63      1
gi|1340174|emb|CAA25821.1|(X01655) type III procollag... +1    68  0.66      1
gi|71415|pir||CGRT2Scollagen alpha 2(I) chain - rat (... +3    54  0.70      2
gi|7494559|pir||T28887collagen dpy-10 - Caenorhabditi... +1    71  0.71      1
gi|4502951ref|NP_000081.1| collagen, type III, alpha ... +1    68  0.73      2
gi|1070603|pir||CGHU7Lcollagen alpha 1(III) chain pre... +1    68  0.73      2
gi|3171998|emb|CAA06510.1|(AJ005395) collagen alpha 1... +1    74  0.77      1
gi|1814029|gb|AAB41793.1|(U84501) cuticle collagen [C... +1    66  0.79      2
gi|258774|gb|AAB23914.1|type II collagen alpha 1 chai... +1    71  0.81      1
gi|2388676|gb|AAB80719.1|(AF015539) precollagen P [My... +1    64  0.81      2

Use the and icons to retrieve links to Entrez:

E = Retrieve Entrez links (e.g., Medline abstracts, FASTA-formatted sequence reports).
R = Retrieve links to Related sequences (neighbors).
Use the icon (if present) to retrieve links to the Sequence Retrieval System (SRS).
Use the icon (if present) to retrieve links to the Ligand Enzyme and Chemical Compound Database .
Use the icon (if present) to retrieve links to the Protein Data Bank database.

WARNING:  Descriptions of 98 database sequences were not reported due to the
          limiting value of parameter V = 50.



to_Entrezto_Relatedto_Related >gi|461707|sp|P34687|CC34_CAEEL  CUTICLE COLLAGEN 34 >gi|345339|pir||JC1448
            collagen col-34 - Caenorhabditis elegans >gi|156250|gb|AAA27985.1|
            (M80650) alpha-collagen [Caenorhabditis elegans]
            Length = 298

Frame  2 hits (HSPs):                   ___________                       
Frame  1 hits (HSPs):                                         __________  
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |        |       |        |       |       |        | 298
                       0       50     100      150     200     250
__________________

Annotated Domains:
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 106..271
   Entrez               Domain: TRIPLE-HELICAL REGION.           103..132
   Entrez               Domain: TRIPLE-HELICAL REGION.           151..177
   Entrez               Domain: TRIPLE-HELICAL REGION.           181..198
   Entrez               Domain: TRIPLE-HELICAL REGION.           215..277
   PFAM                 Collagen: Collagen triple helix repeat ( 143..201
   PFAM                 Collagen: Collagen triple helix repeat ( 215..274
   PRODOM               PD004226:                                6..32
   PRODOM               PD000926:                                34..100
   PRODOM               PD000540: H1(18) O76786(11) TONB(10)     102..197
   PRODOM               PD026369: CC34(1) Q20087(1)              238..258
   PRODOM               PD002391:                                278..297
__________________


  Plus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.00086, Sum P(2) = 0.00086
 Identities = 19/52 (36%), Positives = 23/52 (44%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294
             G P    +P  P P  P G  G+ G  G  P + G   P  + GER I  KY
Sbjct:   233 GQPGADGSPGQPGPKGPNGPDGQPGADG-NPGAPGPAGPPGSPGERGICPKY 283

 Score = 48 (16.9 bits), Expect = 0.00086, Sum P(2) = 0.00086
 Identities = 11/31 (35%), Positives = 14/31 (45%), Frame = +2

Query:    65 GRPRRPNCSLEXRPQWRTGPKSP-GRGRQPG 154
             GRP +  C     P W+  P+ P G    PG
Sbjct:   130 GRPPQQPCEPITPPPWKPCPQGPPGPPGPPG 160

 Score = 38 (13.4 bits), Expect = 0.0090, Sum P(2) = 0.0090
 Identities = 9/24 (37%), Positives = 11/24 (45%), Frame = +2

Query:    83 NCSLEXRPQWRTGPKSPGRGRQPG 154
             +C L   P     P  PGR  +PG
Sbjct:    98 SCCLPGPPGPAGTPGKPGRPGKPG 121


to_Entrezto_Relatedto_Related >gi|115399|sp|P16252|CAC2_HAECO  CUTICLE COLLAGEN 2C >gi|159167|gb|AAA29172.1|
            (J04670) collagen 2c [Haemonchus contortus]
            Length = 210

Frame  2 hits (HSPs):                 ____________                        
Frame  1 hits (HSPs):                                     _____________   
Annotated Domains:                    _______________ _______________     
                        __________________________________________________
Database sequence:     |           |           |           |           |  | 210
                       0          50         100         150         200
__________________

Annotated Domains:
   PFAM                 Collagen: Collagen triple helix repeat ( 62..121
   PFAM                 Collagen: Collagen triple helix repeat ( 130..189
__________________


  Plus Strand HSPs:

 Score = 70 (24.6 bits), Expect = 0.0024, Sum P(2) = 0.0024
 Identities = 19/52 (36%), Positives = 21/52 (40%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294
             GAP    AP  P    PRG  G  G  G     G   +P    GER +  KY
Sbjct:   145 GAPGHPGAPGAPGEKGPRGQDGHPGAPGNAGHPGQPGQPG-PPGERGVCPKY 195

 Score = 40 (14.1 bits), Expect = 0.0024, Sum P(2) = 0.0024
 Identities = 12/39 (30%), Positives = 14/39 (35%), Frame = +2

Query:    50 PTMKIGRPRRPNCSLEXRPQWRTGPKSP----GRGRQPG 154
             P  + G P  P       P    GPK P    G+   PG
Sbjct:    72 PPGEPGTPGNPGAPGNDAPPGPPGPKGPPGPPGKAGAPG 110

 Score = 39 (13.7 bits), Expect = 0.0031, Sum P(2) = 0.0031
 Identities = 9/18 (50%), Positives = 10/18 (55%), Frame = +2

Query:   104 PQWRTGPKSP-GRGRQPG 154
             PQ R GP  P G   +PG
Sbjct:    60 PQGRPGPPGPIGPPGEPG 77


to_Entrezto_Relatedto_Related >gi|321007|pir||B44984  collagen - nematode (Haemonchus contortus) (fragment)
            Length = 210

Frame  2 hits (HSPs):                 ____________                        
Frame  1 hits (HSPs):                                     _____________   
                        __________________________________________________
Database sequence:     |           |           |           |           |  | 210
                       0          50         100         150         200

  Plus Strand HSPs:

 Score = 70 (24.6 bits), Expect = 0.0024, Sum P(2) = 0.0024
 Identities = 19/52 (36%), Positives = 21/52 (40%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294
             GAP    AP  P    PRG  G  G  G     G   +P    GER +  KY
Sbjct:   145 GAPGHPGAPGAPGEKGPRGQDGHPGAPGNAGHPGQPGQPG-PPGERGVCPKY 195

 Score = 40 (14.1 bits), Expect = 0.0024, Sum P(2) = 0.0024
 Identities = 12/39 (30%), Positives = 14/39 (35%), Frame = +2

Query:    50 PTMKIGRPRRPNCSLEXRPQWRTGPKSP----GRGRQPG 154
             P  + G P  P       P    GPK P    G+   PG
Sbjct:    72 PPGEPGTPGNPGAPGNDAPPGPPGPKGPPGPPGKAGAPG 110

 Score = 39 (13.7 bits), Expect = 0.0031, Sum P(2) = 0.0031
 Identities = 9/18 (50%), Positives = 10/18 (55%), Frame = +2

Query:   104 PQWRTGPKSP-GRGRQPG 154
             PQ R GP  P G   +PG
Sbjct:    60 PQGRPGPPGPIGPPGEPG 77


to_Entrezto_Relatedto_Related >gi|7500590|pir||T29956  hypothetical protein F36A4.10 - Caenorhabditis elegans
            >gi|1255802|gb|AAA96155.1| (U53333) coded for by C. elegans cDNA
            yk120g12.5; Similar to cuticular collagen [Caenorhabditis elegans]
            Length = 299

Frame  2 hits (HSPs):                   ___________                       
Frame  1 hits (HSPs):                                         __________  
                        __________________________________________________
Database sequence:     |        |       |       |        |       |        | 299
                       0       50     100     150      200     250

  Plus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.0091, Sum P(2) = 0.0090
 Identities = 19/52 (36%), Positives = 23/52 (44%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294
             G P    +P  P P  P G  G+ G  G  P + G   P  + GER I  KY
Sbjct:   234 GQPGADGSPGQPGPKGPNGPDGQPGADG-NPGAPGPAGPPGSPGERGICPKY 284

 Score = 38 (13.4 bits), Expect = 0.0091, Sum P(2) = 0.0090
 Identities = 9/24 (37%), Positives = 11/24 (45%), Frame = +2

Query:    83 NCSLEXRPQWRTGPKSPGRGRQPG 154
             +C L   P     P  PGR  +PG
Sbjct:    98 SCCLPGPPGPAGTPGKPGRPGKPG 121

 Score = 35 (12.3 bits), Expect = 0.018, Sum P(2) = 0.018
 Identities = 10/31 (32%), Positives = 13/31 (41%), Frame = +2

Query:    65 GRPRRPNCSLEXRPQWRTGPKSP-GRGRQPG 154
             GRP +  C     P  +  P+ P G    PG
Sbjct:   130 GRPPQQPCEPITPPPCKPCPQGPPGPPGPPG 160


to_Entrezto_Relatedto_Related >gi|7331814|gb|AAF60502.1|  (AC006743) contains similarity to Pfam family
            PF01391 (Collagen triple helix repeats), score=66, E=8.2e-16, N=2
            [Caenorhabditis elegans]
            Length = 291

Frame  2 hits (HSPs):                  _______           ______           
Frame  1 hits (HSPs):                              _________  __________  
                        __________________________________________________
Database sequence:     |        |        |       |        |       |       | 291
                       0       50      100     150      200     250

  Plus Strand HSPs:

 Score = 89 (31.3 bits), Expect = 0.014, P = 0.014
 Identities = 19/47 (40%), Positives = 23/47 (48%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERP 279
             G P R  +P  P PV P GA G+SG  G   + G   RP  +T   P
Sbjct:   161 GNPGRPGSPGTPGPVGPNGASGDSGAPGNDGEKGEPGRPAQSTPSTP 207

 Score = 65 (22.9 bits), Expect = 0.59, Sum P(2) = 0.45
 Identities = 23/52 (44%), Positives = 26/52 (50%), Frame = +1

Query:   139 GAPARVRAPSC---PDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294
             GAP R   P     P P  P G VGE+G  G +P   G   P+   GER I  KY
Sbjct:   226 GAPGRDGQPGQSGPPGPPGPPGNVGEAGPPG-KPGQPGLPGPQ---GERGICPKY 276

 Score = 42 (14.8 bits), Expect = 0.59, Sum P(2) = 0.45
 Identities = 15/35 (42%), Positives = 17/35 (48%), Frame = +2

Query:    59 KIGRPRRPNCSLEXRPQWRTG-PKSPGRGRQPG*EP 163
             K G P RP   L  R   + G P +PGR   PG  P
Sbjct:    94 KPGHPGRPG--LPGR-NGKPGVPGAPGRPGTPGRPP 126

 Score = 34 (12.0 bits), Expect = 3.7, Sum P(2) = 0.98
 Identities = 10/30 (33%), Positives = 10/30 (33%), Frame = +2

Query:    65 GRPRRPNCSLEXRPQWRTGPKSPGRGRQPG 154
             G P RP  S    P     P   G    PG
Sbjct:   194 GEPGRPAQSTPSTPGEPGNPGDAGATGAPG 223


to_Entrezto_Related >gi|9625710  ref|NP_039959.1| UL25 FAMILY [human herpesvirus 5]
            >gi|136862|sp|P16761|UL25_HCMVA HYPOTHETICAL PROTEIN UL25
            >gi|73666|pir||QQBET2 UL25 protein - human cytomegalovirus (strain
            AD169) >gi|59630|emb|CAA35424.1| (X17403) UL25 FAMILY [human
            herpesvirus 5]
            Length = 656

Frame -1 hits (HSPs):            _____                                    
Frame -2 hits (HSPs):                                           __        
                        __________________________________________________
Database sequence:     |           |          |           |          |    | 656
                       0         150        300         450        600

  Minus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.040, Sum P(2) = 0.039
 Identities = 18/58 (31%), Positives = 27/58 (46%), Frame = -1

Query:   224 IPKQPDSPTAPRGATGSGHDGALTLAGAPFQGTWARS-ATEDASPDYNSDAEGDRFSW 54
             + + P  P+ P    G G D   +  G+  + T   S +T   +P   S AEGD FS+
Sbjct:   120 VSRPPSVPSLPENGAGGGGDDNSSSGGSSSRTTSNSSRSTSPVAPGEPSAAEGDEFSF 177

 Score = 40 (14.1 bits), Expect = 0.040, Sum P(2) = 0.039
 Identities = 8/12 (66%), Positives = 9/12 (75%), Frame = -2

Query:    52 GXFPVRSPLLRE 17
             G F VR PLLR+
Sbjct:   534 GEFMVRDPLLRD 545


to_Entrezto_Relatedto_Related >gi|2978419|gb|AAC06113.1|  (M63709) alpha-1 type II collagen [Mus musculus]
            Length = 115

Frame  1 hits (HSPs):   __________________________________                
                        __________________________________________________
Database sequence:     |                     |                     |      | 115
                       0                    50                   100

  Plus Strand HSPs:

 Score = 75 (26.4 bits), Expect = 0.084, P = 0.080
 Identities = 26/85 (30%), Positives = 29/85 (34%), Frame = +1

Query:    31 AXEPGKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGES 210
             A EPG+               G A    DR +    GAP     P  P P  P G  G+ 
Sbjct:     1 AGEPGREGSPGADGPPGR--DGAAGVKGDRGETGALGAPGAPGPPGSPGPAGPTGKQGDR 58

Query:   211 GCLGMQPQSGGKFRPRLNTGERPIA 285
             G  G Q    G   P    G R IA
Sbjct:    59 GEAGAQ----GPMGPSGPAGARGIA 79


to_Entrezto_Relatedto_Related >gi|3242649|dbj|BAA29028.1|  (AB015440) alpha 1 type I collagen [Rana
            catesbeiana]
            Length = 1445

Frame -1 hits (HSPs):                         ___                         
Frame -3 hits (HSPs):                                          __         
                        __________________________________________________
Database sequence:     |                 |                |               | 1445
                       0               500             1000

  Minus Strand HSPs:

 Score = 79 (27.8 bits), Expect = 0.10, Sum P(2) = 0.099
 Identities = 24/49 (48%), Positives = 28/49 (57%), Frame = -1

Query:   281 IGLS-PVFSLG-RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138
             +G S P  S G R  P + G I   P  P  PRGA G+ G+DGA   AGAP
Sbjct:   652 VGPSGPAGSRGERGFPGERGAIG--PPGPQGPRGANGAPGNDGAKGEAGAP 700

 Score = 35 (12.3 bits), Expect = 0.10, Sum P(2) = 0.099
 Identities = 11/29 (37%), Positives = 13/29 (44%), Frame = -3

Query:   150 GWRPLPGDLGPVRH*GRXSRLQFGRRGRP 64
             G   LPG +GP    GR   +  G  G P
Sbjct:  1139 GSNGLPGPIGPPGPRGRTGDV--GPAGPP 1165


to_Entrezto_Relatedto_Related >gi|115267|sp||CA11_BOVIN_2  [Segment 2 of 2] COLLAGEN ALPHA 1(I) CHAIN
            Length = 634

Frame  1 hits (HSPs):                                    ______           
Frame -1 hits (HSPs):          ____                                       
Frame -3 hits (HSPs):                                               ____  
                        __________________________________________________
Database sequence:     |           |           |           |           |  | 634
                       0         150         300         450         600

  Plus Strand HSPs:

 Score = 78 (27.5 bits), Expect = 0.64, P = 0.47
 Identities = 23/65 (35%), Positives = 28/65 (43%), Frame = +1

Query:    31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  +P A  S    G   +  DR +    GAP    AP  P PV P G  G
Sbjct:   423 AGPPGESGREG-APGAEGSPGRDGSPGAKGDRGETGPAGAPGPPGAPGAPGPVGPAGKSG 481

Query:   205 ESGCLG 222
             + G  G
Sbjct:   482 DRGETG 487

  Minus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.12, Sum P(2) = 0.11
 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1

Query:   251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138
             R  P + G   + P  P  PRGA G+ G+DGA   AGAP
Sbjct:    99 RGFPGERGV--EGPPGPAGPRGANGAPGNDGAKGDAGAP 135

 Score = 35 (12.3 bits), Expect = 0.12, Sum P(2) = 0.11
 Identities = 11/29 (37%), Positives = 12/29 (41%), Frame = -3

Query:   150 GWRPLPGDLGPVRH*GRXSRLQFGRRGRP 64
             G   LPG +GP    GR      G  G P
Sbjct:   571 GLNGLPGPIGPPGPRGRTG--DAGPAGPP 597


to_Entrezto_Relatedto_Related >gi|4836662|gb|AAD30510.1|AF129925_4  (AF129925) CsoS2 [Acidithiobacillus
            ferrooxidans]
            Length = 766

Frame -1 hits (HSPs):                                              _____  
Frame -2 hits (HSPs):                        __                           
                        __________________________________________________
Database sequence:     |         |         |         |         |        | | 766
                       0       150       300       450       600      750

  Minus Strand HSPs:

 Score = 72 (25.3 bits), Expect = 0.14, Sum P(2) = 0.13
 Identities = 17/57 (29%), Positives = 27/57 (47%), Frame = -1

Query:   233 WGCIPKQPDSPTAPRGA-TGSGHDGALTLAGAPFQGTWARSATEDASPDYNSDAEGDR 63
             +G +P    +   PR   TG GH+G   + GA ++   + + TE  S   N    GD+
Sbjct:   666 YGAVPTTA-ATEVPRSRLTGDGHEGGFAITGAAWRRNESITGTEGTSTRRNQTLRGDQ 722

 Score = 35 (12.3 bits), Expect = 0.14, Sum P(2) = 0.13
 Identities = 6/15 (40%), Positives = 8/15 (53%), Frame = -2

Query:   295 GTCSLSVSRQYLALD 251
             GTC      +YL+ D
Sbjct:   325 GTCKAVTGTEYLSAD 339


to_Entrezto_Related >gi|227093|prf||1614239A  collagen alpha1(V) 66-267 [Homo sapiens]
            Length = 202

Frame  1 hits (HSPs):       __________________                            
                        __________________________________________________
Database sequence:     |            |           |           |            || 202
                       0           50         100         150          200

  Plus Strand HSPs:

 Score = 77 (27.1 bits), Expect = 0.15, P = 0.14
 Identities = 22/68 (32%), Positives = 25/68 (36%), Frame = +1

Query:    43 GKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGESGCLG 222
             GK H  +   S      G      D  QV  +G P +      P P  P G  GE G  G
Sbjct:    21 GKGHRGDPGLSGPPGPPGDDGEEGDDGQVGPRGLPGQPGPRGLPGPKGPPGVTGEPGAPG 80

Query:   223 MQPQSGGK 246
             M  Q G K
Sbjct:    81 MDGQPGPK 88


to_Entrezto_Related >gi|263810|gb|AAB24972.1|  collagen alpha chain [Riftia pachyptila=tube worms,
            Peptide, 1027 aa]
            Length = 1027

Frame -1 hits (HSPs):        ___                                          
Frame -3 hits (HSPs):                __                           __      
                        __________________________________________________
Database sequence:     |       |      |      |       |      |      |      | 1027
                       0     150    300    450     600    750    900

  Minus Strand HSPs:

 Score = 70 (24.6 bits), Expect = 0.17, Sum P(2) = 0.15
 Identities = 17/36 (47%), Positives = 17/36 (47%), Frame = -1

Query:   248 NLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAG 144
             N  PD G  P  P  P  PRG TG  G DG   L G
Sbjct:   116 NQGPDGGPGPAGPSGPIGPRGQTGERGRDGKSGLPG 151

 Score = 39 (13.7 bits), Expect = 0.17, Sum P(2) = 0.15
 Identities = 10/24 (41%), Positives = 12/24 (50%), Frame = -3

Query:   135 PGDLGPVRH*GRXSRLQFGRRGRP 64
             PGD+G   H G       G+RG P
Sbjct:   285 PGDVGAPGHAGEA-----GKRGSP 303

 Score = 34 (12.0 bits), Expect = 0.54, Sum P(2) = 0.42
 Identities = 7/11 (63%), Positives = 7/11 (63%), Frame = -3

Query:   150 GWRPLPGDLGP 118
             G R LPG  GP
Sbjct:   883 GQRGLPGAAGP 893


to_Entrezto_Relatedto_Related >gi|399170|sp|P30754|CAFF_RIFPA  FIBRIL-FORMING COLLAGEN ALPHA CHAIN
            Length = 1027

Frame -1 hits (HSPs):        ___                                          
Frame -3 hits (HSPs):                __                           __      
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |       |      |      |       |      |      |      | 1027
                       0     150    300    450     600    750    900
__________________

Annotated Domains:
   DOMO                 DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 17..151
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 153..331
   DOMO                 DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 333..424
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 426..615
   DOMO                 DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 617..711
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 713..861
   Entrez               Domain: NONHELICAL REGION (N-TERMINAL).  1..12
   Entrez               Domain: TRIPLE-HELICAL REGION.           13..1023
   Entrez               Domain: NONHELICAL REGION (C-TERMINAL).  1024..1027
   Entrez               hydroxylation site: (PARTIAL).           21
   Entrez               hydroxylation site: (PARTIAL).           24
   Entrez               hydroxylation site                       27
   Entrez               hydroxylation site                       39
   Entrez               hydroxylation site: (PARTIAL).           53
   Entrez               hydroxylation site                       54
   Entrez               hydroxylation site: (PARTIAL).           72
   Entrez               hydroxylation site                       90
   Entrez               hydroxylation site                       93
   Entrez               hydroxylation site: (PARTIAL).           123
   Entrez               hydroxylation site: (PARTIAL).           128
   Entrez               hydroxylation site                       150
   Entrez               hydroxylation site: (PARTIAL).           161
   Entrez               hydroxylation site                       162
   Entrez               hydroxylation site: (PARTIAL).           164
   Entrez               hydroxylation site                       165
   Entrez               hydroxylation site                       174
   Entrez               hydroxylation site                       177
   Entrez               hydroxylation site                       180
   Entrez               hydroxylation site                       183
   Entrez               hydroxylation site                       207
   Entrez               hydroxylation site                       216
   Entrez               hydroxylation site                       219
   Entrez               hydroxylation site                       228
   Entrez               hydroxylation site                       237
   Entrez               hydroxylation site: (PARTIAL).           243
   Entrez               hydroxylation site                       249
   Entrez               hydroxylation site                       255
   Entrez               hydroxylation site: (PARTIAL).           273
   Entrez               hydroxylation site: (PARTIAL).           276
   Entrez               hydroxylation site: (PARTIAL).           285
   Entrez               hydroxylation site: (PARTIAL).           291
   Entrez               hydroxylation site: (PARTIAL).           303
   Entrez               hydroxylation site                       306
   Entrez               hydroxylation site                       312
   Entrez               hydroxylation site                       321
   Entrez               hydroxylation site                       327
   Entrez               hydroxylation site                       339
   Entrez               hydroxylation site                       342
   Entrez               hydroxylation site: (PARTIAL).           348
   Entrez               hydroxylation site: (PARTIAL).           351
   Entrez               hydroxylation site                       366
   Entrez               hydroxylation site                       372
   Entrez               hydroxylation site                       375
   Entrez               hydroxylation site: (PARTIAL).           381
   Entrez               hydroxylation site                       387
   Entrez               hydroxylation site: (PARTIAL).           416
   Entrez               hydroxylation site                       417
   Entrez               hydroxylation site                       423
   Entrez               hydroxylation site                       429
   Entrez               hydroxylation site                       432
   Entrez               hydroxylation site                       453
   Entrez               hydroxylation site                       465
   Entrez               hydroxylation site                       483
   Entrez               hydroxylation site: (PARTIAL).           500
   Entrez               hydroxylation site: (PARTIAL).           503
   Entrez               hydroxylation site: (PARTIAL).           506
   Entrez               hydroxylation site                       513
   Entrez               hydroxylation site                       525
   Entrez               hydroxylation site: (PARTIAL).           533
   Entrez               hydroxylation site: (PARTIAL).           536
   Entrez               hydroxylation site                       540
   Entrez               hydroxylation site                       546
   Entrez               hydroxylation site: (PARTIAL).           551
   Entrez               hydroxylation site                       552
   Entrez               hydroxylation site                       561
   Entrez               hydroxylation site                       603
   Entrez               other site: IMPERFECTION IN THE GAA REPE 610
   Entrez               hydroxylation site: (PARTIAL).           621
   Entrez               hydroxylation site                       627
   Entrez               hydroxylation site: (PARTIAL).           645
   Entrez               hydroxylation site: (PARTIAL).           647
   Entrez               hydroxylation site                       648
   Entrez               hydroxylation site                       663
   Entrez               hydroxylation site                       708
   Entrez               hydroxylation site                       711
   Entrez               hydroxylation site                       714
   Entrez               hydroxylation site                       717
   Entrez               hydroxylation site                       723
   Entrez               hydroxylation site                       744
   Entrez               hydroxylation site                       759
   Entrez               hydroxylation site: (PARTIAL).           773
   Entrez               hydroxylation site                       774
   Entrez               hydroxylation site                       783
   Entrez               hydroxylation site                       792
   Entrez               hydroxylation site: (PARTIAL).           815
   Entrez               hydroxylation site                       816
   Entrez               hydroxylation site                       843
   Entrez               hydroxylation site                       849
   Entrez               hydroxylation site                       855
   Entrez               hydroxylation site                       861
   Entrez               hydroxylation site                       867
   Entrez               hydroxylation site                       888
   Entrez               hydroxylation site                       894
   Entrez               hydroxylation site                       903
   Entrez               hydroxylation site                       915
   Entrez               hydroxylation site: (PARTIAL).           933
   Entrez               hydroxylation site                       939
   Entrez               hydroxylation site                       945
   Entrez               hydroxylation site: (PARTIAL).           954
   Entrez               hydroxylation site                       963
   Entrez               hydroxylation site                       966
   Entrez               hydroxylation site                       984
   Entrez               hydroxylation site                       990
   Entrez               hydroxylation site: (PARTIAL).           1010
   Entrez               hydroxylation site                       1011
   Entrez               hydroxylation site: (PARTIAL).           1013
   Entrez               hydroxylation site                       1014
   Entrez               hydroxylation site: (PARTIAL).           1016
   Entrez               hydroxylation site                       1017
   Entrez               hydroxylation site: (PARTIAL).           1019
   Entrez               hydroxylation site                       1020
   PFAM                 Collagen: Collagen triple helix repeat ( 19..78
   PFAM                 Collagen: Collagen triple helix repeat ( 79..138
   PFAM                 Collagen: Collagen triple helix repeat ( 142..201
   PFAM                 Collagen: Collagen triple helix repeat ( 205..264
   PFAM                 Collagen: Collagen triple helix repeat ( 265..324
   PFAM                 Collagen: Collagen triple helix repeat ( 325..384
   PFAM                 Collagen: Collagen triple helix repeat ( 397..456
   PFAM                 Collagen: Collagen triple helix repeat ( 457..516
   PFAM                 Collagen: Collagen triple helix repeat ( 520..579
   PFAM                 Collagen: Collagen triple helix repeat ( 583..642
   PFAM                 Collagen: Collagen triple helix repeat ( 643..702
   PFAM                 Collagen: Collagen triple helix repeat ( 706..765
   PFAM                 Collagen: Collagen triple helix repeat ( 766..825
   PFAM                 Collagen: Collagen triple helix repeat ( 838..897
   PFAM                 Collagen: Collagen triple helix repeat ( 898..957
   PFAM                 Collagen: Collagen triple helix repeat ( 958..1017
   PRODOM               PD193117: CAFF_RIFPA                     32..50
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     52..82
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     116..147
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     150..180
   PRODOM               PD026538: CAFF_RIFPA                     182..217
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     219..249
   PRODOM               PD055865: CAFF_RIFPA                     251..335
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     337..366
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     372..403
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     409..438
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     453..483
   PRODOM               PD159171: CAFF_RIFPA                     485..521
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     523..553
   PRODOM               PD000277: CA14(15) Q26640(12) Q26639(11) 574..615
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     620..652
   PRODOM               PD193705: CAFF_RIFPA                     654..677
   PRODOM               PD000277: CA14(15) Q26640(12) Q26639(11) 679..714
   PRODOM               PD193158: CAFF_RIFPA                     729..753
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     755..792
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     838..867
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     888..919
   PRODOM               PD193594: CAFF_RIFPA                     921..959
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     961..991
__________________


  Minus Strand HSPs:

 Score = 70 (24.6 bits), Expect = 0.17, Sum P(2) = 0.15
 Identities = 17/36 (47%), Positives = 17/36 (47%), Frame = -1

Query:   248 NLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAG 144
             N  PD G  P  P  P  PRG TG  G DG   L G
Sbjct:   116 NQGPDGGPGPAGPSGPIGPRGQTGERGRDGKSGLPG 151

 Score = 39 (13.7 bits), Expect = 0.17, Sum P(2) = 0.15
 Identities = 10/24 (41%), Positives = 12/24 (50%), Frame = -3

Query:   135 PGDLGPVRH*GRXSRLQFGRRGRP 64
             PGD+G   H G       G+RG P
Sbjct:   285 PGDVGAPGHAGEA-----GKRGSP 303

 Score = 34 (12.0 bits), Expect = 0.54, Sum P(2) = 0.42
 Identities = 7/11 (63%), Positives = 7/11 (63%), Frame = -3

Query:   150 GWRPLPGDLGP 118
             G R LPG  GP
Sbjct:   883 GQRGLPGAAGP 893


to_Entrezto_Relatedto_Related >gi|2119156|pir||S28774  collagen alpha chain - tube worm (Riftia pachyptila)
            (fragment)
            Length = 1027

Frame -1 hits (HSPs):        ___                                          
Frame -3 hits (HSPs):                __                           __      
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |       |      |      |       |      |      |      | 1027
                       0     150    300    450     600    750    900
__________________

Annotated Domains:
   Entrez               domain: amino-terminal telopeptide (frag 1..12
   Entrez               domain: collagenous                      13..1023
   Entrez               domain: carboxyl-terminal telopeptide (f 1024..1027
   Entrez               modified site: 4-hydroxyproline (Pro) (p 21
   Entrez               modified site: 4-hydroxyproline (Pro) (p 24
   Entrez               modified site: 4-hydroxyproline (Pro) (p 123
   Entrez               modified site: 4-hydroxyproline (Pro) (p 243
   Entrez               modified site: 4-hydroxyproline (Pro) (p 273
   Entrez               modified site: 4-hydroxyproline (Pro) (p 276
   Entrez               modified site: 4-hydroxyproline (Pro) (p 285
   Entrez               modified site: 4-hydroxyproline (Pro) (p 291
   Entrez               modified site: 4-hydroxyproline (Pro) (p 303
   Entrez               modified site: 4-hydroxyproline (Pro) (p 348
   Entrez               modified site: 4-hydroxyproline (Pro) (p 381
   Entrez               modified site: 4-hydroxyproline (Pro) (p 621
   Entrez               modified site: 4-hydroxyproline (Pro) (p 645
   Entrez               modified site: 4-hydroxyproline (Pro)    27
   Entrez               modified site: 4-hydroxyproline (Pro)    39
   Entrez               modified site: 4-hydroxyproline (Pro)    54
   Entrez               modified site: 4-hydroxyproline (Pro)    72
   Entrez               modified site: 4-hydroxyproline (Pro)    90
   Entrez               modified site: 4-hydroxyproline (Pro)    93
   Entrez               modified site: 4-hydroxyproline (Pro)    128
   Entrez               modified site: 4-hydroxyproline (Pro)    150
   Entrez               modified site: 4-hydroxyproline (Pro)    162
   Entrez               modified site: 4-hydroxyproline (Pro)    165
   Entrez               modified site: 4-hydroxyproline (Pro)    174
   Entrez               modified site: 4-hydroxyproline (Pro)    177
   Entrez               modified site: 4-hydroxyproline (Pro)    180
   Entrez               modified site: 4-hydroxyproline (Pro)    207
   Entrez               modified site: 4-hydroxyproline (Pro)    216
   Entrez               modified site: 4-hydroxyproline (Pro)    219
   Entrez               modified site: 4-hydroxyproline (Pro)    228
   Entrez               modified site: 4-hydroxyproline (Pro)    237
   Entrez               modified site: 4-hydroxyproline (Pro)    249
   Entrez               modified site: 4-hydroxyproline (Pro)    255
   Entrez               modified site: 4-hydroxyproline (Pro)    306
   Entrez               modified site: 4-hydroxyproline (Pro)    312
   Entrez               modified site: 4-hydroxyproline (Pro)    321
   Entrez               modified site: 4-hydroxyproline (Pro)    327
   Entrez               modified site: 4-hydroxyproline (Pro)    339
   Entrez               modified site: 4-hydroxyproline (Pro)    366
   Entrez               modified site: 4-hydroxyproline (Pro)    372
   Entrez               modified site: 4-hydroxyproline (Pro)    375
   Entrez               modified site: 4-hydroxyproline (Pro)    387
   Entrez               modified site: 4-hydroxyproline (Pro)    417
   Entrez               modified site: 4-hydroxyproline (Pro)    423
   Entrez               modified site: 4-hydroxyproline (Pro)    429
   Entrez               modified site: 4-hydroxyproline (Pro)    432
   Entrez               modified site: 4-hydroxyproline (Pro)    453
   Entrez               modified site: 4-hydroxyproline (Pro)    465
   Entrez               modified site: 4-hydroxyproline (Pro)    483
   Entrez               modified site: 4-hydroxyproline (Pro)    500
   Entrez               modified site: 4-hydroxyproline (Pro)    503
   Entrez               modified site: 4-hydroxyproline (Pro)    506
   Entrez               modified site: 4-hydroxyproline (Pro)    513
   Entrez               modified site: 4-hydroxyproline (Pro)    525
   Entrez               modified site: 4-hydroxyproline (Pro)    533
   Entrez               modified site: 4-hydroxyproline (Pro)    536
   Entrez               modified site: 4-hydroxyproline (Pro)    540
   Entrez               modified site: 4-hydroxyproline (Pro)    552
   Entrez               modified site: 4-hydroxyproline (Pro)    561
   Entrez               modified site: 4-hydroxyproline (Pro)    603
   Entrez               modified site: 4-hydroxyproline (Pro)    627
   Entrez               modified site: 4-hydroxyproline (Pro)    648
   Entrez               modified site: 4-hydroxyproline (Pro)    663
   Entrez               modified site: 4-hydroxyproline (Pro)    708
   Entrez               modified site: 4-hydroxyproline (Pro)    711
   Entrez               modified site: 4-hydroxyproline (Pro)    714
   Entrez               modified site: 4-hydroxyproline (Pro)    717
   Entrez               modified site: 4-hydroxyproline (Pro)    723
   Entrez               modified site: 4-hydroxyproline (Pro)    744
   Entrez               modified site: 4-hydroxyproline (Pro)    759
   Entrez               modified site: 4-hydroxyproline (Pro)    774
   Entrez               modified site: 4-hydroxyproline (Pro)    783
   Entrez               modified site: 4-hydroxyproline (Pro)    792
   Entrez               modified site: 4-hydroxyproline (Pro)    816
   Entrez               modified site: 4-hydroxyproline (Pro)    843
   Entrez               modified site: 4-hydroxyproline (Pro)    849
   Entrez               modified site: 4-hydroxyproline (Pro)    855
   Entrez               modified site: 4-hydroxyproline (Pro)    861
   Entrez               modified site: 4-hydroxyproline (Pro)    867
   Entrez               modified site: 4-hydroxyproline (Pro)    888
   Entrez               modified site: 4-hydroxyproline (Pro)    894
   Entrez               modified site: 4-hydroxyproline (Pro)    915
   Entrez               modified site: 4-hydroxyproline (Pro)    945
   Entrez               modified site: 4-hydroxyproline (Pro)    954
   Entrez               modified site: 4-hydroxyproline (Pro)    963
   Entrez               modified site: 4-hydroxyproline (Pro)    966
   Entrez               modified site: 4-hydroxyproline (Pro)    984
   Entrez               modified site: 4-hydroxyproline (Pro)    990
   Entrez               modified site: 4-hydroxyproline (Pro)    1011
   Entrez               modified site: 4-hydroxyproline (Pro)    1014
   Entrez               modified site: 4-hydroxyproline (Pro)    1017
   Entrez               modified site: 4-hydroxyproline (Pro)    1020
   Entrez               modified site: 3-hydroxyproline (Pro)    53
   Entrez               modified site: 3-hydroxyproline (Pro)    161
   Entrez               modified site: 3-hydroxyproline (Pro)    165
   Entrez               modified site: 3-hydroxyproline (Pro)    416
   Entrez               modified site: 3-hydroxyproline (Pro)    551
   Entrez               modified site: 3-hydroxyproline (Pro)    647
   Entrez               modified site: 3-hydroxyproline (Pro)    773
   Entrez               modified site: 3-hydroxyproline (Pro)    815
   Entrez               modified site: 3-hydroxyproline (Pro)    1010
   Entrez               modified site: 3-hydroxyproline (Pro)    1013
   Entrez               modified site: 3-hydroxyproline (Pro)    1016
   Entrez               modified site: 3-hydroxyproline (Pro)    1019
   Entrez               modified site: 5-hydroxylysine (Lys)     96
   Entrez               modified site: 5-hydroxylysine (Lys)     108
   Entrez               modified site: 5-hydroxylysine (Lys)     192
   Entrez               modified site: 5-hydroxylysine (Lys)     261
   Entrez               modified site: 5-hydroxylysine (Lys)     279
   Entrez               modified site: 5-hydroxylysine (Lys)     573
   Entrez               modified site: 5-hydroxylysine (Lys)     612
   Entrez               modified site: 5-hydroxylysine (Lys)     657
   Entrez               modified site: 5-hydroxylysine (Lys)     738
   Entrez               modified site: 5-hydroxylysine (Lys)     765
   Entrez               modified site: 5-hydroxylysine (Lys)     810
   Entrez               modified site: 5-hydroxylysine (Lys)     927
   Entrez               modified site: 5-hydroxylysine (Lys)     936
   Entrez               binding site: carbohydrate (Lys) (covale 96
   Entrez               binding site: carbohydrate (Lys) (covale 108
   Entrez               binding site: carbohydrate (Lys) (covale 192
   Entrez               binding site: carbohydrate (Lys) (covale 261
   Entrez               binding site: carbohydrate (Lys) (covale 279
   Entrez               binding site: carbohydrate (Lys) (covale 573
   Entrez               binding site: carbohydrate (Lys) (covale 612
   Entrez               binding site: carbohydrate (Lys) (covale 657
   Entrez               binding site: carbohydrate (Lys) (covale 738
   Entrez               binding site: carbohydrate (Lys) (covale 765
   Entrez               binding site: carbohydrate (Lys) (covale 810
   Entrez               binding site: carbohydrate (Lys) (covale 927
   Entrez               binding site: carbohydrate (Lys) (covale 936
   Entrez               modified site: 5-hydroxylysine (Lys)     183
   Entrez               modified site: 5-hydroxylysine (Lys)     342
   Entrez               modified site: 5-hydroxylysine (Lys)     546
   Entrez               modified site: 5-hydroxylysine (Lys)     567
   Entrez               modified site: 5-hydroxylysine (Lys)     939
   Entrez               modified site: 5-hydroxylysine (Lys) (pa 351
   Entrez               modified site: 5-hydroxylysine (Lys) (pa 933
__________________


  Minus Strand HSPs:

 Score = 70 (24.6 bits), Expect = 0.17, Sum P(2) = 0.15
 Identities = 17/36 (47%), Positives = 17/36 (47%), Frame = -1

Query:   248 NLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAG 144
             N  PD G  P  P  P  PRG TG  G DG   L G
Sbjct:   116 NQGPDGGPGPAGPSGPIGPRGQTGERGRDGKSGLPG 151

 Score = 39 (13.7 bits), Expect = 0.17, Sum P(2) = 0.15
 Identities = 10/24 (41%), Positives = 12/24 (50%), Frame = -3

Query:   135 PGDLGPVRH*GRXSRLQFGRRGRP 64
             PGD+G   H G       G+RG P
Sbjct:   285 PGDVGAPGHAGEA-----GKRGSP 303

 Score = 34 (12.0 bits), Expect = 0.54, Sum P(2) = 0.42
 Identities = 7/11 (63%), Positives = 7/11 (63%), Frame = -3

Query:   150 GWRPLPGDLGP 118
             G R LPG  GP
Sbjct:   883 GQRGLPGAAGP 893


to_Entrezto_Relatedto_Related >gi|7499772|pir||T21314  hypothetical protein F23H12.4 - Caenorhabditis elegans
            >gi|3876307|emb|CAA98942.1| (Z74472) predicted using
            Genefinder~contains similarity to Pfam domain: PF01391 (Collagen
            triple helix repeat (20 copies)), Score=84.3, E-value=8.1e-22, N=2;
            PF01484 (Nematode cuticle collagen N-terminal domain), Score=34.2,
            E-value=9.9e-07, N=1~cDNA E>
            Length = 301

Frame  1 hits (HSPs):                                  _________________  
                        __________________________________________________
Database sequence:     |        |       |       |        |       |       || 301
                       0       50     100     150      200     250     300

  Plus Strand HSPs:

 Score = 79 (27.8 bits), Expect = 0.18, P = 0.17
 Identities = 31/94 (32%), Positives = 36/94 (38%), Frame = +1

Query:    19 PLVTAXEPGKXHHENRSPSASE-L*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRG 195
             P   A  PG    E  +P+ SE L  G      D       G P        P P  P+G
Sbjct:   193 PAGEAGAPGPAG-EPGTPAISEPLTPGAPGEPGDSGPPGPPGPPGAPGNDGPPGPPGPKG 251

Query:   196 AVGESGCLGMQPQSG--GKFRPRLNTGERPIANKY 294
             A G  G  G+  QSG  G   P    GE+ I  KY
Sbjct:   252 APGPDGPPGVDGQSGPPGPPGPAGTPGEKGICPKY 286


to_Entrezto_Related >gi|476420|pir||CGBO1S  collagen alpha 1(I) chain - bovine (tentative sequence)
            (fragments)
            Length = 779

Frame  1 hits (HSPs):                                       _____         
Frame -1 hits (HSPs):                  ___                                
Frame -3 hits (HSPs):                                                ___  
Annotated Domains:      _                                                 
                        __________________________________________________
Database sequence:     |         |         |        |         |         | | 779
                       0       150       300      450       600       750
__________________

Annotated Domains:
   Entrez               modified site: pyrrolidone carboxylic ac 1
__________________


  Plus Strand HSPs:

 Score = 78 (27.5 bits), Expect = 0.81, P = 0.56
 Identities = 23/65 (35%), Positives = 28/65 (43%), Frame = +1

Query:    31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  +P A  S    G   +  DR +    GAP    AP  P PV P G  G
Sbjct:   568 AGPPGESGREG-APGAEGSPGRDGSPGAKGDRGETGPAGAPGPPGAPGAPGPVGPAGKSG 626

Query:   205 ESGCLG 222
             + G  G
Sbjct:   627 DRGETG 632

  Minus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.19, Sum P(2) = 0.17
 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1

Query:   251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138
             R  P + G   + P  P  PRGA G+ G+DGA   AGAP
Sbjct:   244 RGFPGERGV--EGPPGPAGPRGANGAPGNDGAKGDAGAP 280

 Score = 35 (12.3 bits), Expect = 0.19, Sum P(2) = 0.17
 Identities = 11/29 (37%), Positives = 12/29 (41%), Frame = -3

Query:   150 GWRPLPGDLGPVRH*GRXSRLQFGRRGRP 64
             G   LPG +GP    GR      G  G P
Sbjct:   716 GLNGLPGPIGPPGPRGRTG--DAGPAGPP 742


to_Entrezto_Relatedto_Related >gi|7441988|pir||T16984  transcription factor homolog BTF3 - curled-leaved
            tobacco >gi|1666173|emb|CAA70323.1| (Y09106) transcription factor
            [Nicotiana plumbaginifolia]
            Length = 165

Frame  3 hits (HSPs):                           ____________________      
                        __________________________________________________
Database sequence:     |              |               |              |    | 165
                       0             50             100            150

  Plus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 0.20, P = 0.19
 Identities = 21/65 (32%), Positives = 30/65 (46%), Frame = +3

Query:   102 VLSGGPGPSPLEGGASQGESPVVPGPCRTTRRCRRVGLFGNAAPIGR*IPSKAKYWRETD 281
             V+SG P    L+G +S   SPV P    + R   R  +  + AP     P  A   +E D
Sbjct:    83 VVSGSPQTKKLQGYSSSNYSPVGPDNLESLREASRA-VPESRAPSANGAPEGAPALQEDD 141

Query:   282 SEQVP 296
              ++VP
Sbjct:   142 DDEVP 146


to_Entrezto_Relatedto_Related >gi|7516580|pir||C72637  hypothetical protein APE1554 - Aeropyrum pernix (strain
            K1) >gi|5105239|dbj|BAA80553.1| (AP000061) 104aa long hypothetical
            protein [Aeropyrum pernix]
            Length = 104

Frame -2 hits (HSPs):           _________________________________         
                        __________________________________________________
Database sequence:     |         |        |         |        |         |  | 104
                       0        20       40        60       80       100

  Minus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.22, P = 0.20
 Identities = 21/67 (31%), Positives = 34/67 (50%), Frame = -2

Query:   220 PNNPTRRQRLVVRQGPGTTGLS-PWLAPPSRGLGPGPPLR---------TXLQTTIRTPR 71
             P +  RR+R  +R G    G   P++  P  G GPGPP R         + L++  ++P 
Sbjct:    19 PLHRLRRERRGLRAGAPWGGTPRPFVLRPDTGGGPGPPPRGGLRLPGHVSLLRSRSQSPP 78

Query:    70 ATDFHGG 50
             + ++HGG
Sbjct:    79 SHEYHGG 85


to_Entrezto_Related >gi|71405|pir||CGCH1S  collagen alpha 1(I) chain - chicken (tentative sequence)
            (fragments)
            Length = 1042

Frame  1 hits (HSPs):                                           ____      
Frame -1 hits (HSPs):                           ___                       
Frame -3 hits (HSPs):                                                  __ 
Annotated Domains:      _                                                 
                        __________________________________________________
Database sequence:     |       |      |      |      |      |       |      | 1042
                       0     150    300    450    600    750     900
__________________

Annotated Domains:
   Entrez               modified site: pyrrolidone carboxylic ac 1
__________________


  Plus Strand HSPs:

 Score = 75 (26.4 bits), Expect = 2.4, P = 0.91
 Identities = 23/71 (32%), Positives = 28/71 (39%), Frame = +1

Query:    31 AXEPGKXHHENRSPSASEL*S--GXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  +P A       G A    DR +    G P    AP  P PV P G  G
Sbjct:   844 AGPPGEAGREG-APGAEGAPGRDGAAGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 902

Query:   205 ESGCLGMQPQSG 240
             + G  G    +G
Sbjct:   903 DRGETGPAGPAG 914

  Minus Strand HSPs:

 Score = 70 (24.6 bits), Expect = 0.28, Sum P(2) = 0.24
 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1

Query:   251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138
             R  P + G   + P  P  PRGA G+ G+DGA   AGAP
Sbjct:   517 RGFPGERGV--QGPPGPQGPRGANGAPGNDGAKGDAGAP 553

 Score = 37 (13.0 bits), Expect = 0.28, Sum P(2) = 0.24
 Identities = 11/30 (36%), Positives = 13/30 (43%), Frame = -3

Query:   150 GWRPLPGDLGPVRH*GRXSRL-QFGRRGRP 64
             G   LPG +GP    GR   +   G  G P
Sbjct:   992 GLNGLPGPIGPPGPRGRTGEVGPVGPPGPP 1021


to_Entrezto_Relatedto_Related >gi|115397|sp|P08124|CC01_CAEEL  CUTICLE COLLAGEN 1 >gi|84425|pir||A31219
            collagen 1 - Caenorhabditis elegans >gi|6678|emb|CAA23463.1|
            (V00147) unnamed protein product [Caenorhabditis elegans]
            >gi|156258|gb|AAA27988.1| (J01047) collagen [Caenorhabditis
            elegans]
            Length = 296

Frame  1 hits (HSPs):                                  _________________  
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |        |       |        |       |        |       | 296
                       0       50     100      150     200      250
__________________

Annotated Domains:
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 103..269
   Entrez               Domain: TRIPLE-HELICAL REGION.           100..129
   Entrez               Domain: TRIPLE-HELICAL REGION.           148..174
   Entrez               Domain: TRIPLE-HELICAL REGION.           178..204
   Entrez               Domain: TRIPLE-HELICAL REGION.           213..278
   PFAM                 Collagen: Collagen triple helix repeat ( 148..207
   PFAM                 Collagen: Collagen triple helix repeat ( 213..272
   PRODOM               PD004226:                                6..32
   PRODOM               PD000926:                                34..97
   PRODOM               PD000540: H1(18) O76786(11) TONB(10)     100..232
   PRODOM               PD002391:                                276..295
__________________


  Plus Strand HSPs:

 Score = 77 (27.1 bits), Expect = 0.29, P = 0.25
 Identities = 31/94 (32%), Positives = 35/94 (37%), Frame = +1

Query:    19 PLVTAXEPGKXHHENRSPSASE-L*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRG 195
             P   A  PG    E  +P+ SE L  G      D       G P        P P  P+G
Sbjct:   188 PAGEAGAPGPAG-EPGTPAISEPLTPGAPGEPGDSGPPGPPGPPGAPGNDGPPGPPGPKG 246

Query:   196 AVGESGCLGMQPQSG--GKFRPRLNTGERPIANKY 294
             A G  G  G   QSG  G   P    GE+ I  KY
Sbjct:   247 APGPDGPPGADGQSGPPGPPGPAGTPGEKGICPKY 281


to_Entrezto_Relatedto_Related >gi|180882|gb|AAA52053.1|  (M21353) alpha-2 type I collagen [Homo sapiens]
            Length = 54

Frame  3 hits (HSPs):              __________________                     
Frame  1 hits (HSPs):                                      _____________  
                        __________________________________________________
Database sequence:     |                 |                  |             | 54
                       0                20                 40

  Plus Strand HSPs:

 Score = 42 (14.8 bits), Expect = 0.32, Sum P(2) = 0.27
 Identities = 8/21 (38%), Positives = 12/21 (57%), Frame = +3

Query:   111 GGPGPSPLEGGASQGESPVVP 173
             G PGP  ++GG  + + P  P
Sbjct:    13 GPPGPQGVQGGKGE-QGPAGP 32

 Score = 38 (13.4 bits), Expect = 0.32, Sum P(2) = 0.27
 Identities = 7/14 (50%), Positives = 8/14 (57%), Frame = +1

Query:   172 PDPVAPRGAVGESG 213
             P P  P G VG+ G
Sbjct:    39 PGPSGPAGEVGKPG 52


to_Entrezto_Relatedto_Related >gi|5732934|gb|AAD49346.1|  (AF169346) pro-alpha-1 type 1 collagen [Cavia
            porcellus]
            Length = 230

Frame  1 hits (HSPs):                                ________________     
                        __________________________________________________
Database sequence:     |          |          |          |          |      | 230
                       0         50        100        150        200

  Plus Strand HSPs:

 Score = 75 (26.4 bits), Expect = 0.32, P = 0.27
 Identities = 24/71 (33%), Positives = 28/71 (39%), Frame = +1

Query:    31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  SP A  S    G      DR +    G P    AP  P PV P G  G
Sbjct:   137 AGPPGESGREG-SPGAEGSPGRDGSPGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 195

Query:   205 ESGCLGMQPQSG 240
             + G  G    +G
Sbjct:   196 DRGETGPAGPAG 207


to_Entrezto_Relatedto_Related >gi|115411|sp|P17657|CCDC_CAEEL  CUTICLE COLLAGEN DPY-13 >gi|84436|pir||A31921
            collagen dpy-13 precursor - Caenorhabditis elegans
            >gi|156270|gb|AAA27994.1| (M23559) collagen [Caenorhabditis
            elegans] >gi|1123099|gb|AAA83499.1| (U42437) coded for by C.
            elegans cDNA yk100c3.5; coded for by C. elegans cDNA yk58f10.5;
            coded for by C. elegans cDNA yk66g4.5; coded for by C. elegans cDNA
            cm9g8; coded for by C. elegans cDNA yk66g4.3; coded for by C.
            elegans cDNA yk58f10.3; coded for>
            Length = 302

Frame  2 hits (HSPs):                    ____                             
Frame  1 hits (HSPs):                                          _________  
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |        |       |       |       |        |       || 302
                       0       50     100     150     200      250     300
__________________

Annotated Domains:
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 109..275
   Entrez               Domain: TRIPLE-HELICAL REGION.           106..135
   Entrez               Domain: TRIPLE-HELICAL REGION.           154..210
   Entrez               Domain: TRIPLE-HELICAL REGION.           219..278
   PFAM                 Collagen: Collagen triple helix repeat ( 154..213
   PFAM                 Collagen: Collagen triple helix repeat ( 219..278
   PRODOM               PD004226:                                6..32
   PRODOM               PD000926:                                34..104
   PRODOM               PD014680:                                119..143
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     174..204
   PRODOM               PD196428: CCDC_CAEEL                     206..280
   PRODOM               PD002391:                                282..301
__________________


  Plus Strand HSPs:

 Score = 67 (23.6 bits), Expect = 0.35, Sum P(2) = 0.30
 Identities = 19/52 (36%), Positives = 20/52 (38%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294
             G P     P  P P  P G  G+ G  G  P   G   P    GER I  KY
Sbjct:   237 GQPGNDGTPGQPGPKGPPGPDGKPGADG-NPGQPGPVGPPGTPGERGICPKY 287

 Score = 38 (13.4 bits), Expect = 0.35, Sum P(2) = 0.30
 Identities = 9/18 (50%), Positives = 10/18 (55%), Frame = +2

Query:   104 PQWRTG-PKSPGRGRQPG 154
             PQ   G P  PGR  +PG
Sbjct:   107 PQGAPGAPGKPGRPGKPG 124


to_Entrezto_Relatedto_Related >gi|7504118|pir||T22607  hypothetical protein F54B11.1 - Caenorhabditis elegans
            >gi|3877568|emb|CAA94141.1| (Z70208) contains similarity to Pfam
            domain: PF01391 (Collagen triple helix repeat (20 copies)),
            Score=25.2, E-value=5e-05, N=2; PF01484 (Nematode cuticle collagen
            N-terminal domain), Score=-2.7, E-value=0.59, N=1 [Caenorhabditis
            elegans]
            Length = 339

Frame  3 hits (HSPs):                                _______   ____       
Frame  1 hits (HSPs):                                            _______  
                        __________________________________________________
Database sequence:     |                     |                      |     | 339
                       0                   150                    300

  Plus Strand HSPs:

 Score = 65 (22.9 bits), Expect = 0.42, Sum P(2) = 0.34
 Identities = 17/44 (38%), Positives = 20/44 (45%), Frame = +1

Query:   163 PSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294
             P  P P  P G  G  G  G+  QSGG  R     GE+ +  KY
Sbjct:   284 PGLPGPPGPPGRPGSDGNPGVPGQSGGSRRH----GEKGVCPKY 323

 Score = 44 (15.5 bits), Expect = 0.42, Sum P(2) = 0.34
 Identities = 12/25 (48%), Positives = 15/25 (60%), Frame = +3

Query:   105 LSGGPG-PSPLEGGASQGESPVVPGP 179
             L+G PG P  + G   Q  S V+PGP
Sbjct:   198 LNGQPGLPGGM-GPPGQSMSNVLPGP 222

 Score = 37 (13.0 bits), Expect = 2.1, Sum P(2) = 0.88
 Identities = 10/25 (40%), Positives = 13/25 (52%), Frame = +3

Query:   105 LSGGPGPSPLEGGASQGESPVVPGP 179
             L G PG    + GA+    P +PGP
Sbjct:   266 LMGPPGLDA-QNGANGFGPPGLPGP 289

 Score = 33 (11.6 bits), Expect = 5.3, Sum P(2) = 0.99
 Identities = 8/22 (36%), Positives = 8/22 (36%), Frame = +3

Query:   111 GGPGPSPLEGGASQGESPVVPG 176
             G PGP    G       P  PG
Sbjct:   224 GPPGPPGQPGNHGLAGQPGSPG 245


to_Entrezto_Relatedto_Related >gi|83714|pir||JN0254  qutH protein - Emericella nidulans
            Length = 378

Frame  3 hits (HSPs):                              ____________           
Annotated Domains:                             ______                     
                        __________________________________________________
Database sequence:     |                   |                   |          | 378
                       0                 150                 300
__________________

Annotated Domains:
   Entrez               region: zinc binding                     181..213
__________________


  Plus Strand HSPs:

 Score = 77 (27.1 bits), Expect = 0.42, P = 0.34
 Identities = 25/83 (30%), Positives = 37/83 (44%), Frame = +3

Query:    18 SLSNGXRTGKXPP*KSVALGVRIV--VWRXVLSGGPGPSPLEGGASQGESPVVP-GPCRT 188
             S S   RT   PP K+V L +R    +    L     PSPL      GE+P +P  P ++
Sbjct:   209 SCSACARTRNIPPRKAVLLTLRFASGIVGTFLYCDATPSPLNFETGTGENPTIPPAPSKS 268

Query:   189 TRRCRRVGLFGNAAPIGR*IPSKAKY 266
                C R+   G  A +   +P   ++
Sbjct:   269 ASECYRI--LGTRASLS--VPDMTRW 290


to_Entrezto_Relatedto_Related >gi|192264|gb|AAA37334.1|  (M17491) procollagen type I alpha chain [Mus
            musculus]
            Length = 396

Frame  1 hits (HSPs):                                   __________        
                        __________________________________________________
Database sequence:     |                  |                  |            | 396
                       0                150                300

  Plus Strand HSPs:

 Score = 77 (27.1 bits), Expect = 0.45, P = 0.36
 Identities = 24/71 (33%), Positives = 29/71 (40%), Frame = +1

Query:    31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  SP A  S    G   +  DR +    G P    AP  P PV P G  G
Sbjct:   261 AGPPGESGREG-SPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 319

Query:   205 ESGCLGMQPQSG 240
             + G  G    +G
Sbjct:   320 DRGETGPAGPAG 331


to_Entrezto_Relatedto_Related >gi|3288493|emb|CAA75878.1|  (Y15918) COL1A1 and PDGFB fusion transcript [Homo
            sapiens]
            Length = 173

Frame -1 hits (HSPs):                          ___________                
                        __________________________________________________
Database sequence:     |              |             |              |      | 173
                       0             50           100            150

  Minus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.51, P = 0.40
 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1

Query:   251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138
             R  P + G   + P  P  PRGA G+ G+DGA   AGAP
Sbjct:    81 RGFPGERGV--QGPPGPAGPRGANGAPGNDGAKGDAGAP 117


to_Entrezto_Relatedto_Related >gi|10440430|dbj|BAB15748.1|  (AK024458) FLJ00050 protein [Homo sapiens]
            Length = 270

Frame -2 hits (HSPs):                                 _______________     
                        __________________________________________________
Database sequence:     |         |        |        |        |         |   | 270
                       0        50      100      150      200       250

  Minus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 0.54, P = 0.42
 Identities = 28/80 (35%), Positives = 35/80 (43%), Frame = -2

Query:   262 LALDGIYRPIGAAFPNNPTRRQRLVVRQGPGTTGLSPWLAP-PSRGLGPGPPLRTXLQTT 86
             LAL      +    P+ P  +  LV+ Q PG TGLSPW  P PS     G P     Q  
Sbjct:   165 LALPSPPAQLQGLMPSAPQDKS-LVLPQ-PGLTGLSPWRRPRPSST--KGLPQNPGQQAA 220

Query:    85 IRTPRATDFHGGXFPVRSPLL 23
             +   +        FPVRS L+
Sbjct:   221 LWVAQRIKMWPPCFPVRSGLV 241


to_Entrezto_Relatedto_Related >gi|115268|sp|P02457|CA11_CHICK  COLLAGEN ALPHA 1(I) CHAIN PRECURSOR
            Length = 1453

Frame  1 hits (HSPs):                                     ___             
Frame -1 hits (HSPs):                         ___                         
Frame -3 hits (HSPs):                                          __         
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                 |                |               | 1453
                       0               500             1000
__________________

Annotated Domains:
   BLOCKS               BL01208C: VWFC domain proteins.          58..68
   BLOCKS               BL01208B: VWFC domain proteins.          74..88
   BLOCKS               BL01208A: VWFC domain proteins.          1385..1391
   DOMO                 DM00551: VONWILLEBRANDFACTORTYPECREPEAT  1..97
   DOMO                 DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 98..141
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 142..336
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 338..478
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 480..668
   DOMO                 DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 670..849
   DOMO                 DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 915..1139
   DOMO                 DM01418: FIBRILLARCOLLAGENCARBOXYL-TERMI 1273..1307
   DOMO                 DM00860: FIBRILLARCOLLAGENCARBOXYL-TERMI 1309..1452
   Entrez               Domain: VWFC.                            31..89
   Entrez               pyrrolidone-carboxylic-acid site         152
   Entrez               hydroxylation site: (POTENTIAL).         254
   Entrez               hydroxylation site: (POTENTIAL).         851
   Entrez               hydroxylation site: (POTENTIAL).         1081
   Entrez               hydroxylation site: (POTENTIAL).         1097
   Entrez               hydroxylation site: (ONLY 3-HYDROXYPRO A 1153
   PFAM                 vwc: von Willebrand factor type C domain 33..88
   PFAM                 Collagen: Collagen triple helix repeat ( 100..158
   PFAM                 Collagen: Collagen triple helix repeat ( 166..224
   PFAM                 Collagen: Collagen triple helix repeat ( 225..284
   PFAM                 Collagen: Collagen triple helix repeat ( 285..344
   PFAM                 Collagen: Collagen triple helix repeat ( 345..404
   PFAM                 Collagen: Collagen triple helix repeat ( 405..464
   PFAM                 Collagen: Collagen triple helix repeat ( 465..524
   PFAM                 Collagen: Collagen triple helix repeat ( 525..584
   PFAM                 Collagen: Collagen triple helix repeat ( 585..644
   PFAM                 Collagen: Collagen triple helix repeat ( 645..704
   PFAM                 Collagen: Collagen triple helix repeat ( 705..764
   PFAM                 Collagen: Collagen triple helix repeat ( 768..827
   PFAM                 Collagen: Collagen triple helix repeat ( 828..887
   PFAM                 Collagen: Collagen triple helix repeat ( 888..947
   PFAM                 Collagen: Collagen triple helix repeat ( 948..1007
   PFAM                 Collagen: Collagen triple helix repeat ( 1008..1067
   PFAM                 Collagen: Collagen triple helix repeat ( 1068..1127
   PFAM                 Collagen: Collagen triple helix repeat ( 1128..1187
   PFAM                 COLFI: Fibrillar collagen C-terminal dom 1234..1452
   PRODOM               PD162416: CA11_CHICK                     1..24
   PRODOM               PD000826: NEL(12) Q17429(6) NOV(5)       26..104
   PRODOM               PD058205: CA11_CHICK                     106..124
   PRODOM               PD026094: CA11_CHICK                     148..175
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     177..207
   PRODOM               PD018836: CA11(3)                        241..267
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     269..303
   PRODOM               PD000540: H1(18) O76786(11) TONB(10)     306..472
   PRODOM               PD000540: H1(18) O76786(11) TONB(10)     491..663
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     683..713
   PRODOM               PD000540: H1(18) O76786(11) TONB(10)     735..852
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     856..888
   PRODOM               PD187138: CA11(3) O76045(1)              929..960
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     962..992
   PRODOM               PD002493: CA11(3) CA12(2)                1000..1027
   PRODOM               PD007903:                                1079..1102
   PRODOM               PD000007: CA14(90) CA24(88) CA13(40)     1104..1133
   PRODOM               PD038429: CA11(3) O76045(1) Q9YIB4(1)    1185..1232
   PRODOM               PD002078: CA11(3) CA12(2) CA13(2)        1234..1451
__________________


  Plus Strand HSPs:

 Score = 75 (26.4 bits), Expect = 3.4, P = 0.97
 Identities = 23/71 (32%), Positives = 28/71 (39%), Frame = +1

Query:    31 AXEPGKXHHENRSPSASEL*S--GXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  +P A       G A    DR +    G P    AP  P PV P G  G
Sbjct:   995 AGPPGEAGREG-APGAEGAPGRDGAAGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 1053

Query:   205 ESGCLGMQPQSG 240
             + G  G    +G
Sbjct:  1054 DRGETGPAGPAG 1065

  Minus Strand HSPs:

 Score = 70 (24.6 bits), Expect = 0.55, Sum P(2) = 0.42
 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1

Query:   251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138
             R  P + G   + P  P  PRGA G+ G+DGA   AGAP
Sbjct:   668 RGFPGERGV--QGPPGPQGPRGANGAPGNDGAKGDAGAP 704

 Score = 37 (13.0 bits), Expect = 0.55, Sum P(2) = 0.42
 Identities = 11/30 (36%), Positives = 13/30 (43%), Frame = -3

Query:   150 GWRPLPGDLGPVRH*GRXSRL-QFGRRGRP 64
             G   LPG +GP    GR   +   G  G P
Sbjct:  1143 GLNGLPGPIGPPGPRGRTGEVGPVGPPGPP 1172


to_Entrezto_Relatedto_Related >gi|2136732|pir||I45876  collagen alpha 1(II) chain - bovine (fragment)
            >gi|457187|gb|AAA30436.1| (L28918) cyanogen bromide [Bos taurus]
            Length = 93

Frame  1 hits (HSPs):                                  __________________ 
                        __________________________________________________
Database sequence:     |          |         |          |          |       | 93
                       0         20        40         60         80

  Plus Strand HSPs:

 Score = 67 (23.6 bits), Expect = 0.59, P = 0.45
 Identities = 15/34 (44%), Positives = 18/34 (52%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSG 240
             GAP    AP  P P  P G  G +G LG + Q+G
Sbjct:    59 GAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTG 92


to_Entrezto_Relatedto_Related >gi|159960|gb|AAA29439.1|  (M24558) collagen-like protein [Paracentrotus
            lividus]
            Length = 290

Frame  1 hits (HSPs):                     _____________                   
                        __________________________________________________
Database sequence:     |        |        |       |        |       |       | 290
                       0       50      100     150      200     250

  Plus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 0.61, P = 0.45
 Identities = 23/72 (31%), Positives = 28/72 (38%), Frame = +1

Query:    40 PGKXHHEN-RSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGESGC 216
             PG       R P  S   +G    V +R  +   G P    AP  P P   RG  GE G 
Sbjct:   108 PGSQGESGERGPRGSVGPAGPPGGVGERGPM---GPPGMSGAPGAPGPKGDRGLPGERGA 164

Query:   217 LGMQPQSGGKFRP 255
              G +  +G   RP
Sbjct:   165 NGPKGSAGESGRP 177


to_Entrezto_Relatedto_Related >gi|7510834|pir||T28999  hypothetical protein ZC513.8 - Caenorhabditis elegans
            >gi|1255433|gb|AAC48270.1| (U53155) Similar to cuticular collagen;
            coded for by C. elegans cDNA yk58e6.3; coded for by C. elegans cDNA
            yk71h12.3; coded for by C. elegans cDNA yk100d10.3; coded for by C.
            elegans cDNA yk100d4.3; coded for by C. elegans cDNA yk123g7.3;
            coded for by>
            Length = 303

Frame  1 hits (HSPs):                                   ________________  
                        __________________________________________________
Database sequence:     |        |       |       |       |        |       || 303
                       0       50     100     150     200      250     300

  Plus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 0.65, P = 0.48
 Identities = 27/90 (30%), Positives = 37/90 (41%), Frame = +1

Query:    31 AXEPGKXHHENRSPSASE-L*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGE 207
             A +PG    +  +P+ SE L  G      D       G P        P P  P+G+ G 
Sbjct:   199 AGQPGP-QGDAGTPAQSEPLTPGAPGEAGDAGPAGPPGPPGAPGNDGPPGPPGPKGSPGP 257

Query:   208 SGCLGMQPQSGGKFRP-RLNT-GERPIANKY 294
              G  G+  Q+G    P +  T GE+ I  KY
Sbjct:   258 DGPAGVDGQAGPPGPPGQAGTPGEKGICPKY 288


to_Entrezto_Relatedto_Related >gi|930045|emb|CAA33387.1|  (X15332) alpha-1 (III) collagen [Homo sapiens]
            Length = 1078

Frame  3 hits (HSPs):                                       ___           
Frame  2 hits (HSPs):                        __                           
Frame  1 hits (HSPs):                                               ____  
                        __________________________________________________
Database sequence:     |      |      |      |      |      |      |      | | 1078
                       0    150    300    450    600    750    900   1050

  Plus Strand HSPs:

 Score = 68 (23.9 bits), Expect = 0.70, Sum P(2) = 0.50
 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1

Query:   130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276
             P    PA  + A   P P  PRG VG SG  G    SG     G   PR N GER
Sbjct:   971 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 1025

 Score = 41 (14.4 bits), Expect = 0.70, Sum P(2) = 0.50
 Identities = 13/43 (30%), Positives = 17/43 (39%), Frame = +3

Query:    30 GXRTGKXPP*KSVALGVRIVVWRXVLSGGPG-PSPLEGGASQG 155
             G    + PP     LG+  +     L+G PG P P      QG
Sbjct:   786 GSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQG 828

 Score = 35 (12.3 bits), Expect = 6.0, Sum P(2) = 1.0
 Identities = 8/19 (42%), Positives = 9/19 (47%), Frame = +2

Query:    80 PNCSLEXRPQWRTGPKSPG 136
             P  + E  PQ   GP  PG
Sbjct:   461 PGKNGEYGPQGPPGPTGPG 479


to_Entrezto_Relatedto_Related >gi|553198|gb|AAA51816.1|  (M31731) bcl-3 protein [Homo sapiens]
            Length = 77

Frame  1 hits (HSPs):               ________________________              
                        __________________________________________________
Database sequence:     |            |            |            |           | 77
                       0           20           40           60

  Plus Strand HSPs:

 Score = 66 (23.2 bits), Expect = 0.75, P = 0.53
 Identities = 18/41 (43%), Positives = 24/41 (58%), Frame = +1

Query:   121 AQVPWKGAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGG 243
             A +P +  P  +RAPS P+P APRGA G    + + P  GG
Sbjct:    20 AALPLRKRP--LRAPS-PEPAAPRGAAGL--VVPLDPLRGG 55


to_Entrezto_Relatedto_Related >gi|5921192|sp|P02467|CA21_CHICK  COLLAGEN ALPHA 2(I) CHAIN PRECURSOR
            Length = 1362

Frame  2 hits (HSPs):       __                                            
Frame  1 hits (HSPs):                         ___                         
Annotated Domains:         _            _                                 
                        __________________________________________________
Database sequence:     |                  |                 |             | 1362
                       0                500              1000
__________________

Annotated Domains:
   Entrez               modified site: CONVERTED TO AN ALDEHYDE  83
   Entrez               hydroxylation site                       439
   Entrez               hydroxylation site                       442
__________________


  Plus Strand HSPs:

 Score = 69 (24.3 bits), Expect = 0.76, Sum P(2) = 0.53
 Identities = 18/47 (38%), Positives = 20/47 (42%), Frame = +1

Query:   136 KGAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRL--NTG 270
             KG P  V     P P  P G  GE G  G+    G K  P L  +TG
Sbjct:   619 KGEPGNVGPAGAPGPAGPGGIPGERGVAGVPGGKGEKGAPGLRGDTG 665

 Score = 36 (12.7 bits), Expect = 0.76, Sum P(2) = 0.53
 Identities = 10/31 (32%), Positives = 13/31 (41%), Frame = +2

Query:    65 GRPRRPNCSLEXRPQWRTGPKSP-GRGRQPG 154
             G P  P    +  PQ   GP  P G+  + G
Sbjct:   114 GVPGEPGEPGQTGPQGPRGPPGPPGKAGEDG 144


to_Entrezto_Relatedto_Related >gi|192262|gb|AAA37333.1|  (M14423) pro-alpha-1 type I collagen [Mus musculus]
            >gi|224870|prf||1202297A collagen alpha1(I),pro [Mus musculus]
            Length = 611

Frame  1 hits (HSPs):                                          ______     
                        __________________________________________________
Database sequence:     |            |           |           |            || 611
                       0          150         300         450          600

  Plus Strand HSPs:

 Score = 77 (27.1 bits), Expect = 0.78, P = 0.54
 Identities = 24/71 (33%), Positives = 29/71 (40%), Frame = +1

Query:    31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  SP A  S    G   +  DR +    G P    AP  P PV P G  G
Sbjct:   478 AGPPGESGREG-SPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 536

Query:   205 ESGCLGMQPQSG 240
             + G  G    +G
Sbjct:   537 DRGETGPAGPAG 548


to_Entrezto_Related >gi|8922952  ref|NP_060836.1| hypothetical protein FLJ11230 [Homo sapiens]
            >gi|7023765|dbj|BAA92080.1| (AK002092) unnamed protein product
            [Homo sapiens]
            Length = 217

Frame -1 hits (HSPs):    ____________                                     
                        __________________________________________________
Database sequence:     |           |          |           |          |    | 217
                       0          50        100         150        200

  Minus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 0.80, P = 0.55
 Identities = 14/48 (29%), Positives = 22/48 (45%), Frame = -1

Query:   236 DWGCIPKQPDSPTAPRGATGSGHDGALTLAGAPFQGTWARSATEDASP 93
             D G +P+       P+GA  SG  G ++ + +   G W     ED +P
Sbjct:     7 DGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAP 54


to_Entrezto_Relatedto_Related >gi|2143752|pir||I60384  gene T1 protein - rat (fragment)
            >gi|531376|emb|CAA56213.1| (X79816) T1 [Rattus norvegicus]
            Length = 53

Frame  2 hits (HSPs):      _________________                              
Frame  1 hits (HSPs):                                     ____________    
                        __________________________________________________
Database sequence:     |                 |                  |             | 53
                       0                20                 40

  Plus Strand HSPs:

 Score = 41 (14.4 bits), Expect = 0.85, Sum P(2) = 0.57
 Identities = 9/18 (50%), Positives = 11/18 (61%), Frame = +2

Query:   104 PQWRTGPKSP-GRGRQPG 154
             PQ  TGP  P G+  +PG
Sbjct:     5 PQGATGPLGPKGQTGEPG 22

 Score = 35 (12.3 bits), Expect = 0.85, Sum P(2) = 0.57
 Identities = 6/12 (50%), Positives = 8/12 (66%), Frame = +1

Query:   178 PVAPRGAVGESG 213
             P  P+GA G +G
Sbjct:    38 PAGPQGAPGPAG 49


to_Entrezto_Relatedto_Related >gi|2119160|pir||I50629  collagen - chicken (fragment) >gi|63308|emb|CAA23695.1|
            (V00401) collagen [Gallus gallus]
            Length = 473

Frame  1 hits (HSPs):    ________                                         
                        __________________________________________________
Database sequence:     |               |               |               |  | 473
                       0             150             300             450

  Plus Strand HSPs:

 Score = 75 (26.4 bits), Expect = 0.93, P = 0.61
 Identities = 23/71 (32%), Positives = 28/71 (39%), Frame = +1

Query:    31 AXEPGKXHHENRSPSASEL*S--GXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204
             A  PG+   E  +P A       G A    DR +    G P    AP  P PV P G  G
Sbjct:    15 AGPPGEAGREG-APGAEGAPGRDGAAGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 73

Query:   205 ESGCLGMQPQSG 240
             + G  G    +G
Sbjct:    74 DRGETGPAGPAG 85


to_Entrezto_Relatedto_Related >gi|7508879|pir||T28770  hypothetical protein W03D2.1 - Caenorhabditis elegans
            >gi|1947160|gb|AAC48255.1| (AF000298) weak similarity to collagens;
            glycine- and proline-rich [Caenorhabditis elegans]
            Length = 539

Frame  3 hits (HSPs):          __    __                                   
Frame  1 hits (HSPs):                               _____                 
                        __________________________________________________
Database sequence:     |             |             |             |        | 539
                       0           150           300           450

  Plus Strand HSPs:

 Score = 68 (23.9 bits), Expect = 0.99, Sum P(2) = 0.63
 Identities = 21/50 (42%), Positives = 25/50 (50%), Frame = +1

Query:   130 PWKGAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERP 279
             P  G+P   RA S P P  PRG+   +G L   PQ+GG   P   TG  P
Sbjct:   305 PAGGSPPPPRAGSPPPPPPPRGSP-PTGSLP-PPQAGGS-PPPAGTGSPP 351

 Score = 33 (11.6 bits), Expect = 0.99, Sum P(2) = 0.63
 Identities = 6/9 (66%), Positives = 6/9 (66%), Frame = +3

Query:    30 GXRTGKXPP 56
             G RTG  PP
Sbjct:    86 GGRTGSPPP 94

 Score = 33 (11.6 bits), Expect = 0.99, Sum P(2) = 0.63
 Identities = 6/9 (66%), Positives = 6/9 (66%), Frame = +3

Query:    30 GXRTGKXPP 56
             G RTG  PP
Sbjct:   144 GGRTGSPPP 152


to_Entrezto_Related >gi|11437170  ref|XP_003597.1| hypothetical protein FLJ11230 [Homo sapiens]
            Length = 248

Frame -1 hits (HSPs):          __________                                 
                        __________________________________________________
Database sequence:     |         |         |          |         |         | 248
                       0        50       100        150       200

  Minus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 1.0, P = 0.63
 Identities = 14/48 (29%), Positives = 22/48 (45%), Frame = -1

Query:   236 DWGCIPKQPDSPTAPRGATGSGHDGALTLAGAPFQGTWARSATEDASP 93
             D G +P+       P+GA  SG  G ++ + +   G W     ED +P
Sbjct:    38 DGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAP 85


to_Entrezto_Relatedto_Related >gi|1340174|emb|CAA25821.1|  (X01655) type III procollagen (aa 892-1023) [Homo
            sapiens]
            Length = 132

Frame  1 hits (HSPs):                       _____________________         
                        __________________________________________________
Database sequence:     |                  |                  |            | 132
                       0                 50                100

  Plus Strand HSPs:

 Score = 68 (23.9 bits), Expect = 1.1, P = 0.66
 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1

Query:   130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276
             P    PA  + A   P P  PRG VG SG  G    SG     G   PR N GER
Sbjct:    54 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 108


to_Entrezto_Related >gi|71415|pir||CGRT2S  collagen alpha 2(I) chain - rat (tentative sequence)
            (fragments)
            Length = 184

Frame  3 hits (HSPs):          _______                                    
Frame  1 hits (HSPs):                                   _______           
Annotated Domains:      __                                                
                        __________________________________________________
Database sequence:     |             |            |             |         | 184
                       0            50          100           150
__________________

Annotated Domains:
   Entrez               modified site: blocked amino end (Glx) ( 1
   Entrez               modified site: allysine (Lys)            5
__________________


  Plus Strand HSPs:

 Score = 54 (19.0 bits), Expect = 1.2, Sum P(2) = 0.70
 Identities = 10/23 (43%), Positives = 12/23 (52%), Frame = +3

Query:   111 GGPGPSPLEGGASQGESPVVPGP 179
             G PGP   +G A +   P  PGP
Sbjct:    27 GAPGPQGFQGPAGEPGEPGQPGP 49

 Score = 54 (19.0 bits), Expect = 1.2, Sum P(2) = 0.70
 Identities = 9/23 (39%), Positives = 14/23 (60%), Frame = +1

Query:   172 PDPVAPRGAVGESGCLGMQPQSG 240
             P P+ P G  G++G +G  P +G
Sbjct:   120 PGPIGPAGPRGZAGAIGF-PMTG 141


to_Entrezto_Relatedto_Related >gi|7494559|pir||T28887  collagen dpy-10 - Caenorhabditis elegans
            >gi|1213522|gb|AAA91236.1| (U50191) C. elegans collagen dpy-10 gene
            (Levy, A.D., Yang, J. and Kramer, J.M. Mol. Biol Cell 4, 803-17,
            1993) [Caenorhabditis elegans]
            Length = 284

Frame  1 hits (HSPs):                   __________                        
                        __________________________________________________
Database sequence:     |        |        |        |        |       |      | 284
                       0       50      100      150      200     250

  Plus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 1.2, P = 0.71
 Identities = 18/56 (32%), Positives = 23/56 (41%), Frame = +1

Query:   139 GAPARVRAPSC--PDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKYRE 300
             GAP     P C  P P  PRG+ G  G  G+   +G    P     +    N+ RE
Sbjct:    93 GAPLETECPGCCIPGPPGPRGSSGTPGKPGLPGNAGKPGMPGTTPNQTCPLNQVRE 148


to_Entrezto_Related >gi|4502951  ref|NP_000081.1| collagen, type III, alpha 1; Collagen III, alpha-1
            polypeptide [Homo sapiens] >gi|115306|sp|P02461|CA13_HUMAN COLLAGEN
            ALPHA 1(III) CHAIN PRECURSOR >gi|30058|emb|CAA32583.1| (X14420)
            prepro-alpha-1 type 3 collagen [Homo sapiens]
            Length = 1466

Frame  3 hits (HSPs):                                  ___                
Frame  2 hits (HSPs):                       __                            
Frame  1 hits (HSPs):                                         __          
                        __________________________________________________
Database sequence:     |                 |                |               | 1466
                       0               500             1000

  Plus Strand HSPs:

 Score = 68 (23.9 bits), Expect = 1.3, Sum P(2) = 0.73
 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1

Query:   130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276
             P    PA  + A   P P  PRG VG SG  G    SG     G   PR N GER
Sbjct:  1118 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 1172

 Score = 41 (14.4 bits), Expect = 1.3, Sum P(2) = 0.73
 Identities = 13/43 (30%), Positives = 17/43 (39%), Frame = +3

Query:    30 GXRTGKXPP*KSVALGVRIVVWRXVLSGGPG-PSPLEGGASQG 155
             G    + PP     LG+  +     L+G PG P P      QG
Sbjct:   933 GSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQG 975

 Score = 36 (12.7 bits), Expect = 8.9, Sum P(2) = 1.0
 Identities = 8/19 (42%), Positives = 9/19 (47%), Frame = +2

Query:    80 PNCSLEXRPQWRTGPKSPG 136
             P  + E  PQ   GP  PG
Sbjct:   608 PGKNGETGPQGPPGPTGPG 626


to_Entrezto_Related >gi|1070603|pir||CGHU7L  collagen alpha 1(III) chain precursor - human
            Length = 1466

Frame  3 hits (HSPs):                                  ___                
Frame  2 hits (HSPs):                       __                            
Frame  1 hits (HSPs):                                         __          
Annotated Domains:      __________________________________________________
                        __________________________________________________
Database sequence:     |                 |                |               | 1466
                       0               500             1000
__________________

Annotated Domains:
   Entrez               domain: signal sequence                  1..23
   Entrez               domain: amino-terminal propeptide        24..153
   Entrez               domain: von Willebrand factor type C rep 31..91
   Entrez               region: amino-terminal nonhelical telope 154..167
   Entrez               region: helical                          168..1196
   Entrez               region: cell attachment (R-G-D) motif    1091..1093
   Entrez               region: carboxyl-terminal nonhelical tel 1197..1221
   Entrez               domain: carboxyl-terminal propeptide     1222..1466
   Entrez               domain: fibrillar collagen carboxyl-term 1238..1466
   Entrez               modified site: pyrrolidone carboxylic ac 24
   Entrez               cleavage site: Pro-Gln (procollagen N-en 153..154
   Entrez               modified site: pyrrolidone carboxylic ac 154
   Entrez               modified site: allysine (Lys)            161
   Entrez               modified site: allysine (Lys)            1212
   Entrez               modified site: 5-hydroxylysine (Lys)     263
   Entrez               modified site: 5-hydroxylysine (Lys)     284
   Entrez               modified site: 5-hydroxylysine (Lys)     860
   Entrez               modified site: 5-hydroxylysine (Lys)     977
   Entrez               modified site: 5-hydroxylysine (Lys)     1106
   Entrez               binding site: carbohydrate (Lys) (covale 263
   Entrez               modified site: 5-hydroxylysine (Lys) (pa 584
   Entrez               modified site: 5-hydroxylysine (Lys) (pa 1094
   Entrez               cleavage site: Gly-Ile (collagenase)     948..949
   Entrez               binding site: carbohydrate (Lys) (covale 1106
   Entrez               modified site: 3-hydroxyproline (Pro)    1162
   Entrez               cleavage site: Gly-Asp (procollagen C-en 1221..1222
   Entrez               binding site: carbohydrate (Asn) (covale 1367
   PROSITE              GRAM_POS_ANCHORING: Gram-positive cocci  643..648
__________________


  Plus Strand HSPs:

 Score = 68 (23.9 bits), Expect = 1.3, Sum P(2) = 0.73
 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1

Query:   130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276
             P    PA  + A   P P  PRG VG SG  G    SG     G   PR N GER
Sbjct:  1118 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 1172

 Score = 41 (14.4 bits), Expect = 1.3, Sum P(2) = 0.73
 Identities = 13/43 (30%), Positives = 17/43 (39%), Frame = +3

Query:    30 GXRTGKXPP*KSVALGVRIVVWRXVLSGGPG-PSPLEGGASQG 155
             G    + PP     LG+  +     L+G PG P P      QG
Sbjct:   933 GSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQG 975

 Score = 36 (12.7 bits), Expect = 8.9, Sum P(2) = 1.0
 Identities = 8/19 (42%), Positives = 9/19 (47%), Frame = +2

Query:    80 PNCSLEXRPQWRTGPKSPG 136
             P  + E  PQ   GP  PG
Sbjct:   608 PGKNGETGPQGPPGPTGPG 626


to_Entrezto_Relatedto_Related >gi|3171998|emb|CAA06510.1|  (AJ005395) collagen alpha 1 (III) [Rattus
            norvegicus]
            Length = 564

Frame  1 hits (HSPs):            _______                                  
                        __________________________________________________
Database sequence:     |             |            |            |          | 564
                       0           150          300          450

  Plus Strand HSPs:

 Score = 74 (26.0 bits), Expect = 1.5, P = 0.77
 Identities = 22/71 (30%), Positives = 26/71 (36%), Frame = +1

Query:    28 TAXEPGKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGE 207
             TA EPG+  +            G      DR +    GAP     P  P PV P G  G+
Sbjct:   106 TAGEPGRDGNPGSDGQPGR--DGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPSGKNGD 163

Query:   208 SGCLGMQPQSG 240
              G  G    SG
Sbjct:   164 RGETGPAGPSG 174


to_Entrezto_Relatedto_Related >gi|1814029|gb|AAB41793.1|  (U84501) cuticle collagen [Caenorhabditis briggsae]
            Length = 316

Frame  3 hits (HSPs):                                   ___               
Frame  1 hits (HSPs):                                       _______       
                        __________________________________________________
Database sequence:     |       |       |       |       |       |       |  | 316
                       0      50     100     150     200     250     300

  Plus Strand HSPs:

 Score = 66 (23.2 bits), Expect = 1.5, Sum P(2) = 0.79
 Identities = 16/39 (41%), Positives = 19/39 (48%), Frame = +1

Query:   139 GAPARV-RAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRP 255
             GAP +V   P  P P  P G  G +G  G QP   G  +P
Sbjct:   230 GAPGQVVDVPGTPGPAGPPGPPGPAGAPG-QPGQAGSSQP 268

 Score = 35 (12.3 bits), Expect = 1.5, Sum P(2) = 0.79
 Identities = 8/16 (50%), Positives = 9/16 (56%), Frame = +3

Query:   105 LSGGPGPSPLEGGASQ 152
             L G PGP+   G A Q
Sbjct:   204 LPGPPGPAGPPGPAGQ 219


to_Entrezto_Related >gi|258774|gb|AAB23914.1|  type II collagen alpha 1 chain, COL2A1 [human,
            Peptide Partial Mutant, 346 aa]
            Length = 346

Frame  1 hits (HSPs):                                           __________
                        __________________________________________________
Database sequence:     |                     |                     |      | 346
                       0                   150                   300

  Plus Strand HSPs:

 Score = 71 (25.0 bits), Expect = 1.7, P = 0.81
 Identities = 20/66 (30%), Positives = 23/66 (34%), Frame = +1

Query:    31 AXEPGKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGES 210
             A EPG+               G A    DR +    GAP     P  P P  P G  G+ 
Sbjct:   280 AGEPGRQGSPGADGPPGR--DGAAEVKGDRGETGAVGAPGTPGPPGSPGPAGPTGKQGDR 337

Query:   211 GCLGMQ 228
             G  G Q
Sbjct:   338 GEAGAQ 343


to_Entrezto_Relatedto_Related >gi|2388676|gb|AAB80719.1|  (AF015539) precollagen P [Mytilus edulis]
            Length = 902

Frame  2 hits (HSPs):                 _____                               
Frame  1 hits (HSPs):                          ___                        
                        __________________________________________________
Database sequence:     |        |       |       |        |       |       || 902
                       0      150     300     450      600     750     900

  Plus Strand HSPs:

 Score = 64 (22.5 bits), Expect = 1.7, Sum P(2) = 0.81
 Identities = 17/44 (38%), Positives = 19/44 (43%), Frame = +1

Query:   139 GAPARVRAPSCPDPVAPRGAVGESGCLGM---QPQSGGKFRPRL 261
             G P    AP  P    PRG +G SG  G    Q   GG+  P L
Sbjct:   420 GGPGDKGAPGTPGGTGPRGPIGPSGPSGAPGDQGPQGGRGTPGL 463

 Score = 51 (18.0 bits), Expect = 1.7, Sum P(2) = 0.81
 Identities = 15/41 (36%), Positives = 16/41 (39%), Frame = +2

Query:    32 RXNREXPTMKIGRPRRPNCSLEXRPQWRTGPKSPGRGRQPG 154
             R     P  + G P RP  S   RP     P  PGR   PG
Sbjct:   262 RLGNPGPPGQPGNPGRPGSS--GRPGGSGQPGGPGRPGTPG 300

 Score = 45 (15.8 bits), Expect = 6.7, Sum P(2) = 1.0
 Identities = 12/31 (38%), Positives = 14/31 (45%), Frame = +2

Query:    65 GRPRRP-NCSLEXRPQWRTGPKSPGRGRQPG 154
             G P +P N     +P     P  PG G QPG
Sbjct:   297 GTPGKPGNRGQPGQPGGPGQPGHPGAGGQPG 327


WARNING:  HSPs involving 98 database sequences were not reported due to the
          limiting value of parameter B = 50.


Parameters:
  filter=none
  matrix=BLOSUM62
  V=50
  B=50
  E=10
  gi
  H=1
  sort_by_pvalue
  echofilter

  ctxfactor=5.93

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   Std.    0   BLOSUM62                                 0.318   0.135   0.401  
   +3      0   BLOSUM62        0.318   0.135   0.401    0.325   0.144   0.472  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +2      0   BLOSUM62        0.318   0.135   0.401    0.350   0.154   0.561  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   +1      0   BLOSUM62        0.316   0.134   0.417    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -1      0   BLOSUM62        0.315   0.135   0.448    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -2      0   BLOSUM62        0.318   0.135   0.401    0.321   0.142   0.448  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a
   -3      0   BLOSUM62        0.318   0.135   0.401    0.350   0.157   0.586  
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +3      0      100        95       10.  66 3  12 22  0.099   32
                                                    28  0.12    33
   +2      0      100        96       10.  66 3  12 22  0.10    32
                                                    28  0.095   34
   +1      0      100        95       10.  66 3  12 22  0.10    32
                                                    28  0.12    33
   -1      0      100        96       10.  66 3  13 22  0.11    32
                                                    28  0.095   34
   -2      0      100        96       10.  66 3  12 22  0.10    32
                                                    28  0.095   34
   -3      0      100        97       10.  66 3  12 22  0.10    32
                                                    28  0.097   34


Statistics:

  Database:  /usr/local/dot5/sl_home/beauty/seqdb/blast/nr
    Title:  nr
    Release date:  unknown
    Posted date:  4:06 PM CST Feb 28, 2001
    Format:  BLAST
  # of letters in database:  197,782,623
  # of sequences in database:  625,274
  # of database sequences satisfying E:  148
  No. of states in DFA:  572 (56 KB)
  Total size of DFA:  130 KB (192 KB)
  Time to generate neighborhood:  0.01u 0.00s 0.01t  Elapsed: 00:00:00
  No. of threads or processors used:  6
  Search cpu time:  124.11u 0.90s 125.01t  Elapsed: 00:00:36
  Total cpu time:  124.18u 0.96s 125.14t  Elapsed: 00:00:37
  Start:  Wed Jan 16 12:28:01 2002   End:  Wed Jan 16 12:28:38 2002

WARNINGS ISSUED:  2

Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000