WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= 'D03E02.seq' (302 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 625,274 sequences; 197,782,623 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 43 Sequences : less than 43 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 10443 2281 |===================================================== 6310 8162 1524 |=================================== 3980 6638 2567 |=========================================================== 2510 4071 1931 |============================================ 1580 2140 570 |============= 1000 1570 430 |========== 631 1140 277 |====== 398 863 173 |==== 251 690 161 |=== 158 529 118 |== 100 411 114 |== 63.1 297 58 |= 39.8 239 36 |: 25.1 203 31 |: 15.8 172 24 |: >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 148 <<<<<<<<<<<<<<<<< 10.0 148 28 |: 6.31 120 24 |: 3.98 96 31 |: 2.51 65 17 |: 1.58 48 8 |: 1.00 40 9 |: 0.63 31 8 |: 0.40 23 5 |: 0.25 18 7 |: 0.16 11 4 |: 0.10 7 1 |: 0.063 6 1 |: 0.040 5 0 | 0.025 5 0 | 0.016 5 1 |: 0.010 4 1 |: 0.0063 3 0 | 0.0040 3 0 | 0.0025 3 2 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|461707|sp|P34687|CC34_CAEELCUTICLE COLLAGEN 34 >gi... +1 71 0.00086 2 gi|115399|sp|P16252|CAC2_HAECOCUTICLE COLLAGEN 2C >gi... +1 70 0.0024 2 gi|321007|pir||B44984collagen - nematode (Haemonchus ... +1 70 0.0024 2 gi|7500590|pir||T29956hypothetical protein F36A4.10 -... +1 71 0.0090 2 gi|7331814|gb|AAF60502.1|(AC006743) contains similari... +1 89 0.014 1 gi|9625710ref|NP_039959.1| UL25 FAMILY [human herpesv... -1 71 0.039 2 gi|2978419|gb|AAC06113.1|(M63709) alpha-1 type II col... +1 75 0.080 1 gi|3242649|dbj|BAA29028.1|(AB015440) alpha 1 type I c... -1 79 0.099 2 gi|115267|sp||CA11_BOVIN_2[Segment 2 of 2] COLLAGEN A... -1 71 0.11 2 gi|4836662|gb|AAD30510.1|AF129925_4(AF129925) CsoS2 [... -1 72 0.13 2 gi|227093|prf||1614239Acollagen alpha1(V) 66-267 [Hom... +1 77 0.14 1 gi|263810|gb|AAB24972.1|collagen alpha chain [Riftia ... -1 70 0.15 2 gi|399170|sp|P30754|CAFF_RIFPAFIBRIL-FORMING COLLAGEN... -1 70 0.15 2 gi|2119156|pir||S28774collagen alpha chain - tube wor... -1 70 0.15 2 gi|7499772|pir||T21314hypothetical protein F23H12.4 -... +1 79 0.17 1 gi|476420|pir||CGBO1Scollagen alpha 1(I) chain - bovi... -1 71 0.17 2 gi|7441988|pir||T16984transcription factor homolog BT... +3 74 0.19 1 gi|7516580|pir||C72637hypothetical protein APE1554 - ... -2 71 0.20 1 gi|71405|pir||CGCH1Scollagen alpha 1(I) chain - chick... -1 70 0.24 2 gi|115397|sp|P08124|CC01_CAEELCUTICLE COLLAGEN 1 >gi|... +1 77 0.25 1 gi|180882|gb|AAA52053.1|(M21353) alpha-2 type I colla... +3 42 0.27 2 gi|5732934|gb|AAD49346.1|(AF169346) pro-alpha-1 type ... +1 75 0.27 1 gi|115411|sp|P17657|CCDC_CAEELCUTICLE COLLAGEN DPY-13... +1 67 0.30 2 gi|7504118|pir||T22607hypothetical protein F54B11.1 -... +1 65 0.34 2 gi|83714|pir||JN0254qutH protein - Emericella nidulans +3 77 0.34 1 gi|192264|gb|AAA37334.1|(M17491) procollagen type I a... +1 77 0.36 1 gi|3288493|emb|CAA75878.1|(Y15918) COL1A1 and PDGFB f... -1 71 0.40 1 gi|10440430|dbj|BAB15748.1|(AK024458) FLJ00050 protei... -2 74 0.42 1 gi|115268|sp|P02457|CA11_CHICKCOLLAGEN ALPHA 1(I) CHA... -1 70 0.42 2 gi|2136732|pir||I45876collagen alpha 1(II) chain - bo... +1 67 0.45 1 gi|159960|gb|AAA29439.1|(M24558) collagen-like protei... +1 74 0.45 1 gi|7510834|pir||T28999hypothetical protein ZC513.8 - ... +1 74 0.48 1 gi|930045|emb|CAA33387.1|(X15332) alpha-1 (III) colla... +1 68 0.50 2 gi|553198|gb|AAA51816.1|(M31731) bcl-3 protein [Homo ... +1 66 0.53 1 gi|5921192|sp|P02467|CA21_CHICKCOLLAGEN ALPHA 2(I) CH... +1 69 0.53 2 gi|192262|gb|AAA37333.1|(M14423) pro-alpha-1 type I c... +1 77 0.54 1 gi|8922952ref|NP_060836.1| hypothetical protein FLJ11... -1 71 0.55 1 gi|2143752|pir||I60384gene T1 protein - rat (fragment... +2 41 0.57 2 gi|2119160|pir||I50629collagen - chicken (fragment) >... +1 75 0.61 1 gi|7508879|pir||T28770hypothetical protein W03D2.1 - ... +1 68 0.63 2 gi|11437170ref|XP_003597.1| hypothetical protein FLJ1... -1 71 0.63 1 gi|1340174|emb|CAA25821.1|(X01655) type III procollag... +1 68 0.66 1 gi|71415|pir||CGRT2Scollagen alpha 2(I) chain - rat (... +3 54 0.70 2 gi|7494559|pir||T28887collagen dpy-10 - Caenorhabditi... +1 71 0.71 1 gi|4502951ref|NP_000081.1| collagen, type III, alpha ... +1 68 0.73 2 gi|1070603|pir||CGHU7Lcollagen alpha 1(III) chain pre... +1 68 0.73 2 gi|3171998|emb|CAA06510.1|(AJ005395) collagen alpha 1... +1 74 0.77 1 gi|1814029|gb|AAB41793.1|(U84501) cuticle collagen [C... +1 66 0.79 2 gi|258774|gb|AAB23914.1|type II collagen alpha 1 chai... +1 71 0.81 1 gi|2388676|gb|AAB80719.1|(AF015539) precollagen P [My... +1 64 0.81 2
Use the and icons to retrieve links to Entrez:
WARNING: Descriptions of 98 database sequences were not reported due to the limiting value of parameter V = 50. >gi|461707|sp|P34687|CC34_CAEEL CUTICLE COLLAGEN 34 >gi|345339|pir||JC1448 collagen col-34 - Caenorhabditis elegans >gi|156250|gb|AAA27985.1| (M80650) alpha-collagen [Caenorhabditis elegans] Length = 298 Frame 2 hits (HSPs): ___________ Frame 1 hits (HSPs): __________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | | | 298 0 50 100 150 200 250 __________________ Annotated Domains: DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 106..271 Entrez Domain: TRIPLE-HELICAL REGION. 103..132 Entrez Domain: TRIPLE-HELICAL REGION. 151..177 Entrez Domain: TRIPLE-HELICAL REGION. 181..198 Entrez Domain: TRIPLE-HELICAL REGION. 215..277 PFAM Collagen: Collagen triple helix repeat ( 143..201 PFAM Collagen: Collagen triple helix repeat ( 215..274 PRODOM PD004226: 6..32 PRODOM PD000926: 34..100 PRODOM PD000540: H1(18) O76786(11) TONB(10) 102..197 PRODOM PD026369: CC34(1) Q20087(1) 238..258 PRODOM PD002391: 278..297 __________________ Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.00086, Sum P(2) = 0.00086 Identities = 19/52 (36%), Positives = 23/52 (44%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294 G P +P P P P G G+ G G P + G P + GER I KY Sbjct: 233 GQPGADGSPGQPGPKGPNGPDGQPGADG-NPGAPGPAGPPGSPGERGICPKY 283 Score = 48 (16.9 bits), Expect = 0.00086, Sum P(2) = 0.00086 Identities = 11/31 (35%), Positives = 14/31 (45%), Frame = +2 Query: 65 GRPRRPNCSLEXRPQWRTGPKSP-GRGRQPG 154 GRP + C P W+ P+ P G PG Sbjct: 130 GRPPQQPCEPITPPPWKPCPQGPPGPPGPPG 160 Score = 38 (13.4 bits), Expect = 0.0090, Sum P(2) = 0.0090 Identities = 9/24 (37%), Positives = 11/24 (45%), Frame = +2 Query: 83 NCSLEXRPQWRTGPKSPGRGRQPG 154 +C L P P PGR +PG Sbjct: 98 SCCLPGPPGPAGTPGKPGRPGKPG 121 >gi|115399|sp|P16252|CAC2_HAECO CUTICLE COLLAGEN 2C >gi|159167|gb|AAA29172.1| (J04670) collagen 2c [Haemonchus contortus] Length = 210 Frame 2 hits (HSPs): ____________ Frame 1 hits (HSPs): _____________ Annotated Domains: _______________ _______________ __________________________________________________ Database sequence: | | | | | | 210 0 50 100 150 200 __________________ Annotated Domains: PFAM Collagen: Collagen triple helix repeat ( 62..121 PFAM Collagen: Collagen triple helix repeat ( 130..189 __________________ Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.0024, Sum P(2) = 0.0024 Identities = 19/52 (36%), Positives = 21/52 (40%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294 GAP AP P PRG G G G G +P GER + KY Sbjct: 145 GAPGHPGAPGAPGEKGPRGQDGHPGAPGNAGHPGQPGQPG-PPGERGVCPKY 195 Score = 40 (14.1 bits), Expect = 0.0024, Sum P(2) = 0.0024 Identities = 12/39 (30%), Positives = 14/39 (35%), Frame = +2 Query: 50 PTMKIGRPRRPNCSLEXRPQWRTGPKSP----GRGRQPG 154 P + G P P P GPK P G+ PG Sbjct: 72 PPGEPGTPGNPGAPGNDAPPGPPGPKGPPGPPGKAGAPG 110 Score = 39 (13.7 bits), Expect = 0.0031, Sum P(2) = 0.0031 Identities = 9/18 (50%), Positives = 10/18 (55%), Frame = +2 Query: 104 PQWRTGPKSP-GRGRQPG 154 PQ R GP P G +PG Sbjct: 60 PQGRPGPPGPIGPPGEPG 77 >gi|321007|pir||B44984 collagen - nematode (Haemonchus contortus) (fragment) Length = 210 Frame 2 hits (HSPs): ____________ Frame 1 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | | | 210 0 50 100 150 200 Plus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.0024, Sum P(2) = 0.0024 Identities = 19/52 (36%), Positives = 21/52 (40%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294 GAP AP P PRG G G G G +P GER + KY Sbjct: 145 GAPGHPGAPGAPGEKGPRGQDGHPGAPGNAGHPGQPGQPG-PPGERGVCPKY 195 Score = 40 (14.1 bits), Expect = 0.0024, Sum P(2) = 0.0024 Identities = 12/39 (30%), Positives = 14/39 (35%), Frame = +2 Query: 50 PTMKIGRPRRPNCSLEXRPQWRTGPKSP----GRGRQPG 154 P + G P P P GPK P G+ PG Sbjct: 72 PPGEPGTPGNPGAPGNDAPPGPPGPKGPPGPPGKAGAPG 110 Score = 39 (13.7 bits), Expect = 0.0031, Sum P(2) = 0.0031 Identities = 9/18 (50%), Positives = 10/18 (55%), Frame = +2 Query: 104 PQWRTGPKSP-GRGRQPG 154 PQ R GP P G +PG Sbjct: 60 PQGRPGPPGPIGPPGEPG 77 >gi|7500590|pir||T29956 hypothetical protein F36A4.10 - Caenorhabditis elegans >gi|1255802|gb|AAA96155.1| (U53333) coded for by C. elegans cDNA yk120g12.5; Similar to cuticular collagen [Caenorhabditis elegans] Length = 299 Frame 2 hits (HSPs): ___________ Frame 1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | | 299 0 50 100 150 200 250 Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.0091, Sum P(2) = 0.0090 Identities = 19/52 (36%), Positives = 23/52 (44%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294 G P +P P P P G G+ G G P + G P + GER I KY Sbjct: 234 GQPGADGSPGQPGPKGPNGPDGQPGADG-NPGAPGPAGPPGSPGERGICPKY 284 Score = 38 (13.4 bits), Expect = 0.0091, Sum P(2) = 0.0090 Identities = 9/24 (37%), Positives = 11/24 (45%), Frame = +2 Query: 83 NCSLEXRPQWRTGPKSPGRGRQPG 154 +C L P P PGR +PG Sbjct: 98 SCCLPGPPGPAGTPGKPGRPGKPG 121 Score = 35 (12.3 bits), Expect = 0.018, Sum P(2) = 0.018 Identities = 10/31 (32%), Positives = 13/31 (41%), Frame = +2 Query: 65 GRPRRPNCSLEXRPQWRTGPKSP-GRGRQPG 154 GRP + C P + P+ P G PG Sbjct: 130 GRPPQQPCEPITPPPCKPCPQGPPGPPGPPG 160 >gi|7331814|gb|AAF60502.1| (AC006743) contains similarity to Pfam family PF01391 (Collagen triple helix repeats), score=66, E=8.2e-16, N=2 [Caenorhabditis elegans] Length = 291 Frame 2 hits (HSPs): _______ ______ Frame 1 hits (HSPs): _________ __________ __________________________________________________ Database sequence: | | | | | | | 291 0 50 100 150 200 250 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 0.014, P = 0.014 Identities = 19/47 (40%), Positives = 23/47 (48%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERP 279 G P R +P P PV P GA G+SG G + G RP +T P Sbjct: 161 GNPGRPGSPGTPGPVGPNGASGDSGAPGNDGEKGEPGRPAQSTPSTP 207 Score = 65 (22.9 bits), Expect = 0.59, Sum P(2) = 0.45 Identities = 23/52 (44%), Positives = 26/52 (50%), Frame = +1 Query: 139 GAPARVRAPSC---PDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294 GAP R P P P P G VGE+G G +P G P+ GER I KY Sbjct: 226 GAPGRDGQPGQSGPPGPPGPPGNVGEAGPPG-KPGQPGLPGPQ---GERGICPKY 276 Score = 42 (14.8 bits), Expect = 0.59, Sum P(2) = 0.45 Identities = 15/35 (42%), Positives = 17/35 (48%), Frame = +2 Query: 59 KIGRPRRPNCSLEXRPQWRTG-PKSPGRGRQPG*EP 163 K G P RP L R + G P +PGR PG P Sbjct: 94 KPGHPGRPG--LPGR-NGKPGVPGAPGRPGTPGRPP 126 Score = 34 (12.0 bits), Expect = 3.7, Sum P(2) = 0.98 Identities = 10/30 (33%), Positives = 10/30 (33%), Frame = +2 Query: 65 GRPRRPNCSLEXRPQWRTGPKSPGRGRQPG 154 G P RP S P P G PG Sbjct: 194 GEPGRPAQSTPSTPGEPGNPGDAGATGAPG 223 >gi|9625710 ref|NP_039959.1| UL25 FAMILY [human herpesvirus 5] >gi|136862|sp|P16761|UL25_HCMVA HYPOTHETICAL PROTEIN UL25 >gi|73666|pir||QQBET2 UL25 protein - human cytomegalovirus (strain AD169) >gi|59630|emb|CAA35424.1| (X17403) UL25 FAMILY [human herpesvirus 5] Length = 656 Frame -1 hits (HSPs): _____ Frame -2 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | | 656 0 150 300 450 600 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.040, Sum P(2) = 0.039 Identities = 18/58 (31%), Positives = 27/58 (46%), Frame = -1 Query: 224 IPKQPDSPTAPRGATGSGHDGALTLAGAPFQGTWARS-ATEDASPDYNSDAEGDRFSW 54 + + P P+ P G G D + G+ + T S +T +P S AEGD FS+ Sbjct: 120 VSRPPSVPSLPENGAGGGGDDNSSSGGSSSRTTSNSSRSTSPVAPGEPSAAEGDEFSF 177 Score = 40 (14.1 bits), Expect = 0.040, Sum P(2) = 0.039 Identities = 8/12 (66%), Positives = 9/12 (75%), Frame = -2 Query: 52 GXFPVRSPLLRE 17 G F VR PLLR+ Sbjct: 534 GEFMVRDPLLRD 545 >gi|2978419|gb|AAC06113.1| (M63709) alpha-1 type II collagen [Mus musculus] Length = 115 Frame 1 hits (HSPs): __________________________________ __________________________________________________ Database sequence: | | | | 115 0 50 100 Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 0.084, P = 0.080 Identities = 26/85 (30%), Positives = 29/85 (34%), Frame = +1 Query: 31 AXEPGKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGES 210 A EPG+ G A DR + GAP P P P P G G+ Sbjct: 1 AGEPGREGSPGADGPPGR--DGAAGVKGDRGETGALGAPGAPGPPGSPGPAGPTGKQGDR 58 Query: 211 GCLGMQPQSGGKFRPRLNTGERPIA 285 G G Q G P G R IA Sbjct: 59 GEAGAQ----GPMGPSGPAGARGIA 79 >gi|3242649|dbj|BAA29028.1| (AB015440) alpha 1 type I collagen [Rana catesbeiana] Length = 1445 Frame -1 hits (HSPs): ___ Frame -3 hits (HSPs): __ __________________________________________________ Database sequence: | | | | 1445 0 500 1000 Minus Strand HSPs: Score = 79 (27.8 bits), Expect = 0.10, Sum P(2) = 0.099 Identities = 24/49 (48%), Positives = 28/49 (57%), Frame = -1 Query: 281 IGLS-PVFSLG-RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138 +G S P S G R P + G I P P PRGA G+ G+DGA AGAP Sbjct: 652 VGPSGPAGSRGERGFPGERGAIG--PPGPQGPRGANGAPGNDGAKGEAGAP 700 Score = 35 (12.3 bits), Expect = 0.10, Sum P(2) = 0.099 Identities = 11/29 (37%), Positives = 13/29 (44%), Frame = -3 Query: 150 GWRPLPGDLGPVRH*GRXSRLQFGRRGRP 64 G LPG +GP GR + G G P Sbjct: 1139 GSNGLPGPIGPPGPRGRTGDV--GPAGPP 1165 >gi|115267|sp||CA11_BOVIN_2 [Segment 2 of 2] COLLAGEN ALPHA 1(I) CHAIN Length = 634 Frame 1 hits (HSPs): ______ Frame -1 hits (HSPs): ____ Frame -3 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | 634 0 150 300 450 600 Plus Strand HSPs: Score = 78 (27.5 bits), Expect = 0.64, P = 0.47 Identities = 23/65 (35%), Positives = 28/65 (43%), Frame = +1 Query: 31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E +P A S G + DR + GAP AP P PV P G G Sbjct: 423 AGPPGESGREG-APGAEGSPGRDGSPGAKGDRGETGPAGAPGPPGAPGAPGPVGPAGKSG 481 Query: 205 ESGCLG 222 + G G Sbjct: 482 DRGETG 487 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.12, Sum P(2) = 0.11 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1 Query: 251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138 R P + G + P P PRGA G+ G+DGA AGAP Sbjct: 99 RGFPGERGV--EGPPGPAGPRGANGAPGNDGAKGDAGAP 135 Score = 35 (12.3 bits), Expect = 0.12, Sum P(2) = 0.11 Identities = 11/29 (37%), Positives = 12/29 (41%), Frame = -3 Query: 150 GWRPLPGDLGPVRH*GRXSRLQFGRRGRP 64 G LPG +GP GR G G P Sbjct: 571 GLNGLPGPIGPPGPRGRTG--DAGPAGPP 597 >gi|4836662|gb|AAD30510.1|AF129925_4 (AF129925) CsoS2 [Acidithiobacillus ferrooxidans] Length = 766 Frame -1 hits (HSPs): _____ Frame -2 hits (HSPs): __ __________________________________________________ Database sequence: | | | | | | | 766 0 150 300 450 600 750 Minus Strand HSPs: Score = 72 (25.3 bits), Expect = 0.14, Sum P(2) = 0.13 Identities = 17/57 (29%), Positives = 27/57 (47%), Frame = -1 Query: 233 WGCIPKQPDSPTAPRGA-TGSGHDGALTLAGAPFQGTWARSATEDASPDYNSDAEGDR 63 +G +P + PR TG GH+G + GA ++ + + TE S N GD+ Sbjct: 666 YGAVPTTA-ATEVPRSRLTGDGHEGGFAITGAAWRRNESITGTEGTSTRRNQTLRGDQ 722 Score = 35 (12.3 bits), Expect = 0.14, Sum P(2) = 0.13 Identities = 6/15 (40%), Positives = 8/15 (53%), Frame = -2 Query: 295 GTCSLSVSRQYLALD 251 GTC +YL+ D Sbjct: 325 GTCKAVTGTEYLSAD 339 >gi|227093|prf||1614239A collagen alpha1(V) 66-267 [Homo sapiens] Length = 202 Frame 1 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | || 202 0 50 100 150 200 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.15, P = 0.14 Identities = 22/68 (32%), Positives = 25/68 (36%), Frame = +1 Query: 43 GKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGESGCLG 222 GK H + S G D QV +G P + P P P G GE G G Sbjct: 21 GKGHRGDPGLSGPPGPPGDDGEEGDDGQVGPRGLPGQPGPRGLPGPKGPPGVTGEPGAPG 80 Query: 223 MQPQSGGK 246 M Q G K Sbjct: 81 MDGQPGPK 88 >gi|263810|gb|AAB24972.1| collagen alpha chain [Riftia pachyptila=tube worms, Peptide, 1027 aa] Length = 1027 Frame -1 hits (HSPs): ___ Frame -3 hits (HSPs): __ __ __________________________________________________ Database sequence: | | | | | | | | 1027 0 150 300 450 600 750 900 Minus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.17, Sum P(2) = 0.15 Identities = 17/36 (47%), Positives = 17/36 (47%), Frame = -1 Query: 248 NLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAG 144 N PD G P P P PRG TG G DG L G Sbjct: 116 NQGPDGGPGPAGPSGPIGPRGQTGERGRDGKSGLPG 151 Score = 39 (13.7 bits), Expect = 0.17, Sum P(2) = 0.15 Identities = 10/24 (41%), Positives = 12/24 (50%), Frame = -3 Query: 135 PGDLGPVRH*GRXSRLQFGRRGRP 64 PGD+G H G G+RG P Sbjct: 285 PGDVGAPGHAGEA-----GKRGSP 303 Score = 34 (12.0 bits), Expect = 0.54, Sum P(2) = 0.42 Identities = 7/11 (63%), Positives = 7/11 (63%), Frame = -3 Query: 150 GWRPLPGDLGP 118 G R LPG GP Sbjct: 883 GQRGLPGAAGP 893 >gi|399170|sp|P30754|CAFF_RIFPA FIBRIL-FORMING COLLAGEN ALPHA CHAIN Length = 1027 Frame -1 hits (HSPs): ___ Frame -3 hits (HSPs): __ __ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | | | | 1027 0 150 300 450 600 750 900 __________________ Annotated Domains: DOMO DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 17..151 DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 153..331 DOMO DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 333..424 DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 426..615 DOMO DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 617..711 DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 713..861 Entrez Domain: NONHELICAL REGION (N-TERMINAL). 1..12 Entrez Domain: TRIPLE-HELICAL REGION. 13..1023 Entrez Domain: NONHELICAL REGION (C-TERMINAL). 1024..1027 Entrez hydroxylation site: (PARTIAL). 21 Entrez hydroxylation site: (PARTIAL). 24 Entrez hydroxylation site 27 Entrez hydroxylation site 39 Entrez hydroxylation site: (PARTIAL). 53 Entrez hydroxylation site 54 Entrez hydroxylation site: (PARTIAL). 72 Entrez hydroxylation site 90 Entrez hydroxylation site 93 Entrez hydroxylation site: (PARTIAL). 123 Entrez hydroxylation site: (PARTIAL). 128 Entrez hydroxylation site 150 Entrez hydroxylation site: (PARTIAL). 161 Entrez hydroxylation site 162 Entrez hydroxylation site: (PARTIAL). 164 Entrez hydroxylation site 165 Entrez hydroxylation site 174 Entrez hydroxylation site 177 Entrez hydroxylation site 180 Entrez hydroxylation site 183 Entrez hydroxylation site 207 Entrez hydroxylation site 216 Entrez hydroxylation site 219 Entrez hydroxylation site 228 Entrez hydroxylation site 237 Entrez hydroxylation site: (PARTIAL). 243 Entrez hydroxylation site 249 Entrez hydroxylation site 255 Entrez hydroxylation site: (PARTIAL). 273 Entrez hydroxylation site: (PARTIAL). 276 Entrez hydroxylation site: (PARTIAL). 285 Entrez hydroxylation site: (PARTIAL). 291 Entrez hydroxylation site: (PARTIAL). 303 Entrez hydroxylation site 306 Entrez hydroxylation site 312 Entrez hydroxylation site 321 Entrez hydroxylation site 327 Entrez hydroxylation site 339 Entrez hydroxylation site 342 Entrez hydroxylation site: (PARTIAL). 348 Entrez hydroxylation site: (PARTIAL). 351 Entrez hydroxylation site 366 Entrez hydroxylation site 372 Entrez hydroxylation site 375 Entrez hydroxylation site: (PARTIAL). 381 Entrez hydroxylation site 387 Entrez hydroxylation site: (PARTIAL). 416 Entrez hydroxylation site 417 Entrez hydroxylation site 423 Entrez hydroxylation site 429 Entrez hydroxylation site 432 Entrez hydroxylation site 453 Entrez hydroxylation site 465 Entrez hydroxylation site 483 Entrez hydroxylation site: (PARTIAL). 500 Entrez hydroxylation site: (PARTIAL). 503 Entrez hydroxylation site: (PARTIAL). 506 Entrez hydroxylation site 513 Entrez hydroxylation site 525 Entrez hydroxylation site: (PARTIAL). 533 Entrez hydroxylation site: (PARTIAL). 536 Entrez hydroxylation site 540 Entrez hydroxylation site 546 Entrez hydroxylation site: (PARTIAL). 551 Entrez hydroxylation site 552 Entrez hydroxylation site 561 Entrez hydroxylation site 603 Entrez other site: IMPERFECTION IN THE GAA REPE 610 Entrez hydroxylation site: (PARTIAL). 621 Entrez hydroxylation site 627 Entrez hydroxylation site: (PARTIAL). 645 Entrez hydroxylation site: (PARTIAL). 647 Entrez hydroxylation site 648 Entrez hydroxylation site 663 Entrez hydroxylation site 708 Entrez hydroxylation site 711 Entrez hydroxylation site 714 Entrez hydroxylation site 717 Entrez hydroxylation site 723 Entrez hydroxylation site 744 Entrez hydroxylation site 759 Entrez hydroxylation site: (PARTIAL). 773 Entrez hydroxylation site 774 Entrez hydroxylation site 783 Entrez hydroxylation site 792 Entrez hydroxylation site: (PARTIAL). 815 Entrez hydroxylation site 816 Entrez hydroxylation site 843 Entrez hydroxylation site 849 Entrez hydroxylation site 855 Entrez hydroxylation site 861 Entrez hydroxylation site 867 Entrez hydroxylation site 888 Entrez hydroxylation site 894 Entrez hydroxylation site 903 Entrez hydroxylation site 915 Entrez hydroxylation site: (PARTIAL). 933 Entrez hydroxylation site 939 Entrez hydroxylation site 945 Entrez hydroxylation site: (PARTIAL). 954 Entrez hydroxylation site 963 Entrez hydroxylation site 966 Entrez hydroxylation site 984 Entrez hydroxylation site 990 Entrez hydroxylation site: (PARTIAL). 1010 Entrez hydroxylation site 1011 Entrez hydroxylation site: (PARTIAL). 1013 Entrez hydroxylation site 1014 Entrez hydroxylation site: (PARTIAL). 1016 Entrez hydroxylation site 1017 Entrez hydroxylation site: (PARTIAL). 1019 Entrez hydroxylation site 1020 PFAM Collagen: Collagen triple helix repeat ( 19..78 PFAM Collagen: Collagen triple helix repeat ( 79..138 PFAM Collagen: Collagen triple helix repeat ( 142..201 PFAM Collagen: Collagen triple helix repeat ( 205..264 PFAM Collagen: Collagen triple helix repeat ( 265..324 PFAM Collagen: Collagen triple helix repeat ( 325..384 PFAM Collagen: Collagen triple helix repeat ( 397..456 PFAM Collagen: Collagen triple helix repeat ( 457..516 PFAM Collagen: Collagen triple helix repeat ( 520..579 PFAM Collagen: Collagen triple helix repeat ( 583..642 PFAM Collagen: Collagen triple helix repeat ( 643..702 PFAM Collagen: Collagen triple helix repeat ( 706..765 PFAM Collagen: Collagen triple helix repeat ( 766..825 PFAM Collagen: Collagen triple helix repeat ( 838..897 PFAM Collagen: Collagen triple helix repeat ( 898..957 PFAM Collagen: Collagen triple helix repeat ( 958..1017 PRODOM PD193117: CAFF_RIFPA 32..50 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 52..82 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 116..147 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 150..180 PRODOM PD026538: CAFF_RIFPA 182..217 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 219..249 PRODOM PD055865: CAFF_RIFPA 251..335 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 337..366 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 372..403 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 409..438 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 453..483 PRODOM PD159171: CAFF_RIFPA 485..521 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 523..553 PRODOM PD000277: CA14(15) Q26640(12) Q26639(11) 574..615 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 620..652 PRODOM PD193705: CAFF_RIFPA 654..677 PRODOM PD000277: CA14(15) Q26640(12) Q26639(11) 679..714 PRODOM PD193158: CAFF_RIFPA 729..753 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 755..792 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 838..867 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 888..919 PRODOM PD193594: CAFF_RIFPA 921..959 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 961..991 __________________ Minus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.17, Sum P(2) = 0.15 Identities = 17/36 (47%), Positives = 17/36 (47%), Frame = -1 Query: 248 NLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAG 144 N PD G P P P PRG TG G DG L G Sbjct: 116 NQGPDGGPGPAGPSGPIGPRGQTGERGRDGKSGLPG 151 Score = 39 (13.7 bits), Expect = 0.17, Sum P(2) = 0.15 Identities = 10/24 (41%), Positives = 12/24 (50%), Frame = -3 Query: 135 PGDLGPVRH*GRXSRLQFGRRGRP 64 PGD+G H G G+RG P Sbjct: 285 PGDVGAPGHAGEA-----GKRGSP 303 Score = 34 (12.0 bits), Expect = 0.54, Sum P(2) = 0.42 Identities = 7/11 (63%), Positives = 7/11 (63%), Frame = -3 Query: 150 GWRPLPGDLGP 118 G R LPG GP Sbjct: 883 GQRGLPGAAGP 893 >gi|2119156|pir||S28774 collagen alpha chain - tube worm (Riftia pachyptila) (fragment) Length = 1027 Frame -1 hits (HSPs): ___ Frame -3 hits (HSPs): __ __ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | | | | 1027 0 150 300 450 600 750 900 __________________ Annotated Domains: Entrez domain: amino-terminal telopeptide (frag 1..12 Entrez domain: collagenous 13..1023 Entrez domain: carboxyl-terminal telopeptide (f 1024..1027 Entrez modified site: 4-hydroxyproline (Pro) (p 21 Entrez modified site: 4-hydroxyproline (Pro) (p 24 Entrez modified site: 4-hydroxyproline (Pro) (p 123 Entrez modified site: 4-hydroxyproline (Pro) (p 243 Entrez modified site: 4-hydroxyproline (Pro) (p 273 Entrez modified site: 4-hydroxyproline (Pro) (p 276 Entrez modified site: 4-hydroxyproline (Pro) (p 285 Entrez modified site: 4-hydroxyproline (Pro) (p 291 Entrez modified site: 4-hydroxyproline (Pro) (p 303 Entrez modified site: 4-hydroxyproline (Pro) (p 348 Entrez modified site: 4-hydroxyproline (Pro) (p 381 Entrez modified site: 4-hydroxyproline (Pro) (p 621 Entrez modified site: 4-hydroxyproline (Pro) (p 645 Entrez modified site: 4-hydroxyproline (Pro) 27 Entrez modified site: 4-hydroxyproline (Pro) 39 Entrez modified site: 4-hydroxyproline (Pro) 54 Entrez modified site: 4-hydroxyproline (Pro) 72 Entrez modified site: 4-hydroxyproline (Pro) 90 Entrez modified site: 4-hydroxyproline (Pro) 93 Entrez modified site: 4-hydroxyproline (Pro) 128 Entrez modified site: 4-hydroxyproline (Pro) 150 Entrez modified site: 4-hydroxyproline (Pro) 162 Entrez modified site: 4-hydroxyproline (Pro) 165 Entrez modified site: 4-hydroxyproline (Pro) 174 Entrez modified site: 4-hydroxyproline (Pro) 177 Entrez modified site: 4-hydroxyproline (Pro) 180 Entrez modified site: 4-hydroxyproline (Pro) 207 Entrez modified site: 4-hydroxyproline (Pro) 216 Entrez modified site: 4-hydroxyproline (Pro) 219 Entrez modified site: 4-hydroxyproline (Pro) 228 Entrez modified site: 4-hydroxyproline (Pro) 237 Entrez modified site: 4-hydroxyproline (Pro) 249 Entrez modified site: 4-hydroxyproline (Pro) 255 Entrez modified site: 4-hydroxyproline (Pro) 306 Entrez modified site: 4-hydroxyproline (Pro) 312 Entrez modified site: 4-hydroxyproline (Pro) 321 Entrez modified site: 4-hydroxyproline (Pro) 327 Entrez modified site: 4-hydroxyproline (Pro) 339 Entrez modified site: 4-hydroxyproline (Pro) 366 Entrez modified site: 4-hydroxyproline (Pro) 372 Entrez modified site: 4-hydroxyproline (Pro) 375 Entrez modified site: 4-hydroxyproline (Pro) 387 Entrez modified site: 4-hydroxyproline (Pro) 417 Entrez modified site: 4-hydroxyproline (Pro) 423 Entrez modified site: 4-hydroxyproline (Pro) 429 Entrez modified site: 4-hydroxyproline (Pro) 432 Entrez modified site: 4-hydroxyproline (Pro) 453 Entrez modified site: 4-hydroxyproline (Pro) 465 Entrez modified site: 4-hydroxyproline (Pro) 483 Entrez modified site: 4-hydroxyproline (Pro) 500 Entrez modified site: 4-hydroxyproline (Pro) 503 Entrez modified site: 4-hydroxyproline (Pro) 506 Entrez modified site: 4-hydroxyproline (Pro) 513 Entrez modified site: 4-hydroxyproline (Pro) 525 Entrez modified site: 4-hydroxyproline (Pro) 533 Entrez modified site: 4-hydroxyproline (Pro) 536 Entrez modified site: 4-hydroxyproline (Pro) 540 Entrez modified site: 4-hydroxyproline (Pro) 552 Entrez modified site: 4-hydroxyproline (Pro) 561 Entrez modified site: 4-hydroxyproline (Pro) 603 Entrez modified site: 4-hydroxyproline (Pro) 627 Entrez modified site: 4-hydroxyproline (Pro) 648 Entrez modified site: 4-hydroxyproline (Pro) 663 Entrez modified site: 4-hydroxyproline (Pro) 708 Entrez modified site: 4-hydroxyproline (Pro) 711 Entrez modified site: 4-hydroxyproline (Pro) 714 Entrez modified site: 4-hydroxyproline (Pro) 717 Entrez modified site: 4-hydroxyproline (Pro) 723 Entrez modified site: 4-hydroxyproline (Pro) 744 Entrez modified site: 4-hydroxyproline (Pro) 759 Entrez modified site: 4-hydroxyproline (Pro) 774 Entrez modified site: 4-hydroxyproline (Pro) 783 Entrez modified site: 4-hydroxyproline (Pro) 792 Entrez modified site: 4-hydroxyproline (Pro) 816 Entrez modified site: 4-hydroxyproline (Pro) 843 Entrez modified site: 4-hydroxyproline (Pro) 849 Entrez modified site: 4-hydroxyproline (Pro) 855 Entrez modified site: 4-hydroxyproline (Pro) 861 Entrez modified site: 4-hydroxyproline (Pro) 867 Entrez modified site: 4-hydroxyproline (Pro) 888 Entrez modified site: 4-hydroxyproline (Pro) 894 Entrez modified site: 4-hydroxyproline (Pro) 915 Entrez modified site: 4-hydroxyproline (Pro) 945 Entrez modified site: 4-hydroxyproline (Pro) 954 Entrez modified site: 4-hydroxyproline (Pro) 963 Entrez modified site: 4-hydroxyproline (Pro) 966 Entrez modified site: 4-hydroxyproline (Pro) 984 Entrez modified site: 4-hydroxyproline (Pro) 990 Entrez modified site: 4-hydroxyproline (Pro) 1011 Entrez modified site: 4-hydroxyproline (Pro) 1014 Entrez modified site: 4-hydroxyproline (Pro) 1017 Entrez modified site: 4-hydroxyproline (Pro) 1020 Entrez modified site: 3-hydroxyproline (Pro) 53 Entrez modified site: 3-hydroxyproline (Pro) 161 Entrez modified site: 3-hydroxyproline (Pro) 165 Entrez modified site: 3-hydroxyproline (Pro) 416 Entrez modified site: 3-hydroxyproline (Pro) 551 Entrez modified site: 3-hydroxyproline (Pro) 647 Entrez modified site: 3-hydroxyproline (Pro) 773 Entrez modified site: 3-hydroxyproline (Pro) 815 Entrez modified site: 3-hydroxyproline (Pro) 1010 Entrez modified site: 3-hydroxyproline (Pro) 1013 Entrez modified site: 3-hydroxyproline (Pro) 1016 Entrez modified site: 3-hydroxyproline (Pro) 1019 Entrez modified site: 5-hydroxylysine (Lys) 96 Entrez modified site: 5-hydroxylysine (Lys) 108 Entrez modified site: 5-hydroxylysine (Lys) 192 Entrez modified site: 5-hydroxylysine (Lys) 261 Entrez modified site: 5-hydroxylysine (Lys) 279 Entrez modified site: 5-hydroxylysine (Lys) 573 Entrez modified site: 5-hydroxylysine (Lys) 612 Entrez modified site: 5-hydroxylysine (Lys) 657 Entrez modified site: 5-hydroxylysine (Lys) 738 Entrez modified site: 5-hydroxylysine (Lys) 765 Entrez modified site: 5-hydroxylysine (Lys) 810 Entrez modified site: 5-hydroxylysine (Lys) 927 Entrez modified site: 5-hydroxylysine (Lys) 936 Entrez binding site: carbohydrate (Lys) (covale 96 Entrez binding site: carbohydrate (Lys) (covale 108 Entrez binding site: carbohydrate (Lys) (covale 192 Entrez binding site: carbohydrate (Lys) (covale 261 Entrez binding site: carbohydrate (Lys) (covale 279 Entrez binding site: carbohydrate (Lys) (covale 573 Entrez binding site: carbohydrate (Lys) (covale 612 Entrez binding site: carbohydrate (Lys) (covale 657 Entrez binding site: carbohydrate (Lys) (covale 738 Entrez binding site: carbohydrate (Lys) (covale 765 Entrez binding site: carbohydrate (Lys) (covale 810 Entrez binding site: carbohydrate (Lys) (covale 927 Entrez binding site: carbohydrate (Lys) (covale 936 Entrez modified site: 5-hydroxylysine (Lys) 183 Entrez modified site: 5-hydroxylysine (Lys) 342 Entrez modified site: 5-hydroxylysine (Lys) 546 Entrez modified site: 5-hydroxylysine (Lys) 567 Entrez modified site: 5-hydroxylysine (Lys) 939 Entrez modified site: 5-hydroxylysine (Lys) (pa 351 Entrez modified site: 5-hydroxylysine (Lys) (pa 933 __________________ Minus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.17, Sum P(2) = 0.15 Identities = 17/36 (47%), Positives = 17/36 (47%), Frame = -1 Query: 248 NLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAG 144 N PD G P P P PRG TG G DG L G Sbjct: 116 NQGPDGGPGPAGPSGPIGPRGQTGERGRDGKSGLPG 151 Score = 39 (13.7 bits), Expect = 0.17, Sum P(2) = 0.15 Identities = 10/24 (41%), Positives = 12/24 (50%), Frame = -3 Query: 135 PGDLGPVRH*GRXSRLQFGRRGRP 64 PGD+G H G G+RG P Sbjct: 285 PGDVGAPGHAGEA-----GKRGSP 303 Score = 34 (12.0 bits), Expect = 0.54, Sum P(2) = 0.42 Identities = 7/11 (63%), Positives = 7/11 (63%), Frame = -3 Query: 150 GWRPLPGDLGP 118 G R LPG GP Sbjct: 883 GQRGLPGAAGP 893 >gi|7499772|pir||T21314 hypothetical protein F23H12.4 - Caenorhabditis elegans >gi|3876307|emb|CAA98942.1| (Z74472) predicted using Genefinder~contains similarity to Pfam domain: PF01391 (Collagen triple helix repeat (20 copies)), Score=84.3, E-value=8.1e-22, N=2; PF01484 (Nematode cuticle collagen N-terminal domain), Score=34.2, E-value=9.9e-07, N=1~cDNA E> Length = 301 Frame 1 hits (HSPs): _________________ __________________________________________________ Database sequence: | | | | | | || 301 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 79 (27.8 bits), Expect = 0.18, P = 0.17 Identities = 31/94 (32%), Positives = 36/94 (38%), Frame = +1 Query: 19 PLVTAXEPGKXHHENRSPSASE-L*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRG 195 P A PG E +P+ SE L G D G P P P P+G Sbjct: 193 PAGEAGAPGPAG-EPGTPAISEPLTPGAPGEPGDSGPPGPPGPPGAPGNDGPPGPPGPKG 251 Query: 196 AVGESGCLGMQPQSG--GKFRPRLNTGERPIANKY 294 A G G G+ QSG G P GE+ I KY Sbjct: 252 APGPDGPPGVDGQSGPPGPPGPAGTPGEKGICPKY 286 >gi|476420|pir||CGBO1S collagen alpha 1(I) chain - bovine (tentative sequence) (fragments) Length = 779 Frame 1 hits (HSPs): _____ Frame -1 hits (HSPs): ___ Frame -3 hits (HSPs): ___ Annotated Domains: _ __________________________________________________ Database sequence: | | | | | | | 779 0 150 300 450 600 750 __________________ Annotated Domains: Entrez modified site: pyrrolidone carboxylic ac 1 __________________ Plus Strand HSPs: Score = 78 (27.5 bits), Expect = 0.81, P = 0.56 Identities = 23/65 (35%), Positives = 28/65 (43%), Frame = +1 Query: 31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E +P A S G + DR + GAP AP P PV P G G Sbjct: 568 AGPPGESGREG-APGAEGSPGRDGSPGAKGDRGETGPAGAPGPPGAPGAPGPVGPAGKSG 626 Query: 205 ESGCLG 222 + G G Sbjct: 627 DRGETG 632 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.19, Sum P(2) = 0.17 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1 Query: 251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138 R P + G + P P PRGA G+ G+DGA AGAP Sbjct: 244 RGFPGERGV--EGPPGPAGPRGANGAPGNDGAKGDAGAP 280 Score = 35 (12.3 bits), Expect = 0.19, Sum P(2) = 0.17 Identities = 11/29 (37%), Positives = 12/29 (41%), Frame = -3 Query: 150 GWRPLPGDLGPVRH*GRXSRLQFGRRGRP 64 G LPG +GP GR G G P Sbjct: 716 GLNGLPGPIGPPGPRGRTG--DAGPAGPP 742 >gi|7441988|pir||T16984 transcription factor homolog BTF3 - curled-leaved tobacco >gi|1666173|emb|CAA70323.1| (Y09106) transcription factor [Nicotiana plumbaginifolia] Length = 165 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | 165 0 50 100 150 Plus Strand HSPs: Score = 74 (26.0 bits), Expect = 0.20, P = 0.19 Identities = 21/65 (32%), Positives = 30/65 (46%), Frame = +3 Query: 102 VLSGGPGPSPLEGGASQGESPVVPGPCRTTRRCRRVGLFGNAAPIGR*IPSKAKYWRETD 281 V+SG P L+G +S SPV P + R R + + AP P A +E D Sbjct: 83 VVSGSPQTKKLQGYSSSNYSPVGPDNLESLREASRA-VPESRAPSANGAPEGAPALQEDD 141 Query: 282 SEQVP 296 ++VP Sbjct: 142 DDEVP 146 >gi|7516580|pir||C72637 hypothetical protein APE1554 - Aeropyrum pernix (strain K1) >gi|5105239|dbj|BAA80553.1| (AP000061) 104aa long hypothetical protein [Aeropyrum pernix] Length = 104 Frame -2 hits (HSPs): _________________________________ __________________________________________________ Database sequence: | | | | | | | 104 0 20 40 60 80 100 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.22, P = 0.20 Identities = 21/67 (31%), Positives = 34/67 (50%), Frame = -2 Query: 220 PNNPTRRQRLVVRQGPGTTGLS-PWLAPPSRGLGPGPPLR---------TXLQTTIRTPR 71 P + RR+R +R G G P++ P G GPGPP R + L++ ++P Sbjct: 19 PLHRLRRERRGLRAGAPWGGTPRPFVLRPDTGGGPGPPPRGGLRLPGHVSLLRSRSQSPP 78 Query: 70 ATDFHGG 50 + ++HGG Sbjct: 79 SHEYHGG 85 >gi|71405|pir||CGCH1S collagen alpha 1(I) chain - chicken (tentative sequence) (fragments) Length = 1042 Frame 1 hits (HSPs): ____ Frame -1 hits (HSPs): ___ Frame -3 hits (HSPs): __ Annotated Domains: _ __________________________________________________ Database sequence: | | | | | | | | 1042 0 150 300 450 600 750 900 __________________ Annotated Domains: Entrez modified site: pyrrolidone carboxylic ac 1 __________________ Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 2.4, P = 0.91 Identities = 23/71 (32%), Positives = 28/71 (39%), Frame = +1 Query: 31 AXEPGKXHHENRSPSASEL*S--GXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E +P A G A DR + G P AP P PV P G G Sbjct: 844 AGPPGEAGREG-APGAEGAPGRDGAAGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 902 Query: 205 ESGCLGMQPQSG 240 + G G +G Sbjct: 903 DRGETGPAGPAG 914 Minus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.28, Sum P(2) = 0.24 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1 Query: 251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138 R P + G + P P PRGA G+ G+DGA AGAP Sbjct: 517 RGFPGERGV--QGPPGPQGPRGANGAPGNDGAKGDAGAP 553 Score = 37 (13.0 bits), Expect = 0.28, Sum P(2) = 0.24 Identities = 11/30 (36%), Positives = 13/30 (43%), Frame = -3 Query: 150 GWRPLPGDLGPVRH*GRXSRL-QFGRRGRP 64 G LPG +GP GR + G G P Sbjct: 992 GLNGLPGPIGPPGPRGRTGEVGPVGPPGPP 1021 >gi|115397|sp|P08124|CC01_CAEEL CUTICLE COLLAGEN 1 >gi|84425|pir||A31219 collagen 1 - Caenorhabditis elegans >gi|6678|emb|CAA23463.1| (V00147) unnamed protein product [Caenorhabditis elegans] >gi|156258|gb|AAA27988.1| (J01047) collagen [Caenorhabditis elegans] Length = 296 Frame 1 hits (HSPs): _________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | | | 296 0 50 100 150 200 250 __________________ Annotated Domains: DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 103..269 Entrez Domain: TRIPLE-HELICAL REGION. 100..129 Entrez Domain: TRIPLE-HELICAL REGION. 148..174 Entrez Domain: TRIPLE-HELICAL REGION. 178..204 Entrez Domain: TRIPLE-HELICAL REGION. 213..278 PFAM Collagen: Collagen triple helix repeat ( 148..207 PFAM Collagen: Collagen triple helix repeat ( 213..272 PRODOM PD004226: 6..32 PRODOM PD000926: 34..97 PRODOM PD000540: H1(18) O76786(11) TONB(10) 100..232 PRODOM PD002391: 276..295 __________________ Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.29, P = 0.25 Identities = 31/94 (32%), Positives = 35/94 (37%), Frame = +1 Query: 19 PLVTAXEPGKXHHENRSPSASE-L*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRG 195 P A PG E +P+ SE L G D G P P P P+G Sbjct: 188 PAGEAGAPGPAG-EPGTPAISEPLTPGAPGEPGDSGPPGPPGPPGAPGNDGPPGPPGPKG 246 Query: 196 AVGESGCLGMQPQSG--GKFRPRLNTGERPIANKY 294 A G G G QSG G P GE+ I KY Sbjct: 247 APGPDGPPGADGQSGPPGPPGPAGTPGEKGICPKY 281 >gi|180882|gb|AAA52053.1| (M21353) alpha-2 type I collagen [Homo sapiens] Length = 54 Frame 3 hits (HSPs): __________________ Frame 1 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | 54 0 20 40 Plus Strand HSPs: Score = 42 (14.8 bits), Expect = 0.32, Sum P(2) = 0.27 Identities = 8/21 (38%), Positives = 12/21 (57%), Frame = +3 Query: 111 GGPGPSPLEGGASQGESPVVP 173 G PGP ++GG + + P P Sbjct: 13 GPPGPQGVQGGKGE-QGPAGP 32 Score = 38 (13.4 bits), Expect = 0.32, Sum P(2) = 0.27 Identities = 7/14 (50%), Positives = 8/14 (57%), Frame = +1 Query: 172 PDPVAPRGAVGESG 213 P P P G VG+ G Sbjct: 39 PGPSGPAGEVGKPG 52 >gi|5732934|gb|AAD49346.1| (AF169346) pro-alpha-1 type 1 collagen [Cavia porcellus] Length = 230 Frame 1 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | | 230 0 50 100 150 200 Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 0.32, P = 0.27 Identities = 24/71 (33%), Positives = 28/71 (39%), Frame = +1 Query: 31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E SP A S G DR + G P AP P PV P G G Sbjct: 137 AGPPGESGREG-SPGAEGSPGRDGSPGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 195 Query: 205 ESGCLGMQPQSG 240 + G G +G Sbjct: 196 DRGETGPAGPAG 207 >gi|115411|sp|P17657|CCDC_CAEEL CUTICLE COLLAGEN DPY-13 >gi|84436|pir||A31921 collagen dpy-13 precursor - Caenorhabditis elegans >gi|156270|gb|AAA27994.1| (M23559) collagen [Caenorhabditis elegans] >gi|1123099|gb|AAA83499.1| (U42437) coded for by C. elegans cDNA yk100c3.5; coded for by C. elegans cDNA yk58f10.5; coded for by C. elegans cDNA yk66g4.5; coded for by C. elegans cDNA cm9g8; coded for by C. elegans cDNA yk66g4.3; coded for by C. elegans cDNA yk58f10.3; coded for> Length = 302 Frame 2 hits (HSPs): ____ Frame 1 hits (HSPs): _________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | | | || 302 0 50 100 150 200 250 300 __________________ Annotated Domains: DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 109..275 Entrez Domain: TRIPLE-HELICAL REGION. 106..135 Entrez Domain: TRIPLE-HELICAL REGION. 154..210 Entrez Domain: TRIPLE-HELICAL REGION. 219..278 PFAM Collagen: Collagen triple helix repeat ( 154..213 PFAM Collagen: Collagen triple helix repeat ( 219..278 PRODOM PD004226: 6..32 PRODOM PD000926: 34..104 PRODOM PD014680: 119..143 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 174..204 PRODOM PD196428: CCDC_CAEEL 206..280 PRODOM PD002391: 282..301 __________________ Plus Strand HSPs: Score = 67 (23.6 bits), Expect = 0.35, Sum P(2) = 0.30 Identities = 19/52 (36%), Positives = 20/52 (38%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294 G P P P P P G G+ G G P G P GER I KY Sbjct: 237 GQPGNDGTPGQPGPKGPPGPDGKPGADG-NPGQPGPVGPPGTPGERGICPKY 287 Score = 38 (13.4 bits), Expect = 0.35, Sum P(2) = 0.30 Identities = 9/18 (50%), Positives = 10/18 (55%), Frame = +2 Query: 104 PQWRTG-PKSPGRGRQPG 154 PQ G P PGR +PG Sbjct: 107 PQGAPGAPGKPGRPGKPG 124 >gi|7504118|pir||T22607 hypothetical protein F54B11.1 - Caenorhabditis elegans >gi|3877568|emb|CAA94141.1| (Z70208) contains similarity to Pfam domain: PF01391 (Collagen triple helix repeat (20 copies)), Score=25.2, E-value=5e-05, N=2; PF01484 (Nematode cuticle collagen N-terminal domain), Score=-2.7, E-value=0.59, N=1 [Caenorhabditis elegans] Length = 339 Frame 3 hits (HSPs): _______ ____ Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | 339 0 150 300 Plus Strand HSPs: Score = 65 (22.9 bits), Expect = 0.42, Sum P(2) = 0.34 Identities = 17/44 (38%), Positives = 20/44 (45%), Frame = +1 Query: 163 PSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKY 294 P P P P G G G G+ QSGG R GE+ + KY Sbjct: 284 PGLPGPPGPPGRPGSDGNPGVPGQSGGSRRH----GEKGVCPKY 323 Score = 44 (15.5 bits), Expect = 0.42, Sum P(2) = 0.34 Identities = 12/25 (48%), Positives = 15/25 (60%), Frame = +3 Query: 105 LSGGPG-PSPLEGGASQGESPVVPGP 179 L+G PG P + G Q S V+PGP Sbjct: 198 LNGQPGLPGGM-GPPGQSMSNVLPGP 222 Score = 37 (13.0 bits), Expect = 2.1, Sum P(2) = 0.88 Identities = 10/25 (40%), Positives = 13/25 (52%), Frame = +3 Query: 105 LSGGPGPSPLEGGASQGESPVVPGP 179 L G PG + GA+ P +PGP Sbjct: 266 LMGPPGLDA-QNGANGFGPPGLPGP 289 Score = 33 (11.6 bits), Expect = 5.3, Sum P(2) = 0.99 Identities = 8/22 (36%), Positives = 8/22 (36%), Frame = +3 Query: 111 GGPGPSPLEGGASQGESPVVPG 176 G PGP G P PG Sbjct: 224 GPPGPPGQPGNHGLAGQPGSPG 245 >gi|83714|pir||JN0254 qutH protein - Emericella nidulans Length = 378 Frame 3 hits (HSPs): ____________ Annotated Domains: ______ __________________________________________________ Database sequence: | | | | 378 0 150 300 __________________ Annotated Domains: Entrez region: zinc binding 181..213 __________________ Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.42, P = 0.34 Identities = 25/83 (30%), Positives = 37/83 (44%), Frame = +3 Query: 18 SLSNGXRTGKXPP*KSVALGVRIV--VWRXVLSGGPGPSPLEGGASQGESPVVP-GPCRT 188 S S RT PP K+V L +R + L PSPL GE+P +P P ++ Sbjct: 209 SCSACARTRNIPPRKAVLLTLRFASGIVGTFLYCDATPSPLNFETGTGENPTIPPAPSKS 268 Query: 189 TRRCRRVGLFGNAAPIGR*IPSKAKY 266 C R+ G A + +P ++ Sbjct: 269 ASECYRI--LGTRASLS--VPDMTRW 290 >gi|192264|gb|AAA37334.1| (M17491) procollagen type I alpha chain [Mus musculus] Length = 396 Frame 1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | 396 0 150 300 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.45, P = 0.36 Identities = 24/71 (33%), Positives = 29/71 (40%), Frame = +1 Query: 31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E SP A S G + DR + G P AP P PV P G G Sbjct: 261 AGPPGESGREG-SPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 319 Query: 205 ESGCLGMQPQSG 240 + G G +G Sbjct: 320 DRGETGPAGPAG 331 >gi|3288493|emb|CAA75878.1| (Y15918) COL1A1 and PDGFB fusion transcript [Homo sapiens] Length = 173 Frame -1 hits (HSPs): ___________ __________________________________________________ Database sequence: | | | | | 173 0 50 100 150 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.51, P = 0.40 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1 Query: 251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138 R P + G + P P PRGA G+ G+DGA AGAP Sbjct: 81 RGFPGERGV--QGPPGPAGPRGANGAPGNDGAKGDAGAP 117 >gi|10440430|dbj|BAB15748.1| (AK024458) FLJ00050 protein [Homo sapiens] Length = 270 Frame -2 hits (HSPs): _______________ __________________________________________________ Database sequence: | | | | | | | 270 0 50 100 150 200 250 Minus Strand HSPs: Score = 74 (26.0 bits), Expect = 0.54, P = 0.42 Identities = 28/80 (35%), Positives = 35/80 (43%), Frame = -2 Query: 262 LALDGIYRPIGAAFPNNPTRRQRLVVRQGPGTTGLSPWLAP-PSRGLGPGPPLRTXLQTT 86 LAL + P+ P + LV+ Q PG TGLSPW P PS G P Q Sbjct: 165 LALPSPPAQLQGLMPSAPQDKS-LVLPQ-PGLTGLSPWRRPRPSST--KGLPQNPGQQAA 220 Query: 85 IRTPRATDFHGGXFPVRSPLL 23 + + FPVRS L+ Sbjct: 221 LWVAQRIKMWPPCFPVRSGLV 241 >gi|115268|sp|P02457|CA11_CHICK COLLAGEN ALPHA 1(I) CHAIN PRECURSOR Length = 1453 Frame 1 hits (HSPs): ___ Frame -1 hits (HSPs): ___ Frame -3 hits (HSPs): __ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 1453 0 500 1000 __________________ Annotated Domains: BLOCKS BL01208C: VWFC domain proteins. 58..68 BLOCKS BL01208B: VWFC domain proteins. 74..88 BLOCKS BL01208A: VWFC domain proteins. 1385..1391 DOMO DM00551: VONWILLEBRANDFACTORTYPECREPEAT 1..97 DOMO DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 98..141 DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 142..336 DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 338..478 DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 480..668 DOMO DM00042: FIBRILLARCOLLAGENCARBOXYL-TERMI 670..849 DOMO DM00019: FIBRILLARCOLLAGENCARBOXYL-TERMI 915..1139 DOMO DM01418: FIBRILLARCOLLAGENCARBOXYL-TERMI 1273..1307 DOMO DM00860: FIBRILLARCOLLAGENCARBOXYL-TERMI 1309..1452 Entrez Domain: VWFC. 31..89 Entrez pyrrolidone-carboxylic-acid site 152 Entrez hydroxylation site: (POTENTIAL). 254 Entrez hydroxylation site: (POTENTIAL). 851 Entrez hydroxylation site: (POTENTIAL). 1081 Entrez hydroxylation site: (POTENTIAL). 1097 Entrez hydroxylation site: (ONLY 3-HYDROXYPRO A 1153 PFAM vwc: von Willebrand factor type C domain 33..88 PFAM Collagen: Collagen triple helix repeat ( 100..158 PFAM Collagen: Collagen triple helix repeat ( 166..224 PFAM Collagen: Collagen triple helix repeat ( 225..284 PFAM Collagen: Collagen triple helix repeat ( 285..344 PFAM Collagen: Collagen triple helix repeat ( 345..404 PFAM Collagen: Collagen triple helix repeat ( 405..464 PFAM Collagen: Collagen triple helix repeat ( 465..524 PFAM Collagen: Collagen triple helix repeat ( 525..584 PFAM Collagen: Collagen triple helix repeat ( 585..644 PFAM Collagen: Collagen triple helix repeat ( 645..704 PFAM Collagen: Collagen triple helix repeat ( 705..764 PFAM Collagen: Collagen triple helix repeat ( 768..827 PFAM Collagen: Collagen triple helix repeat ( 828..887 PFAM Collagen: Collagen triple helix repeat ( 888..947 PFAM Collagen: Collagen triple helix repeat ( 948..1007 PFAM Collagen: Collagen triple helix repeat ( 1008..1067 PFAM Collagen: Collagen triple helix repeat ( 1068..1127 PFAM Collagen: Collagen triple helix repeat ( 1128..1187 PFAM COLFI: Fibrillar collagen C-terminal dom 1234..1452 PRODOM PD162416: CA11_CHICK 1..24 PRODOM PD000826: NEL(12) Q17429(6) NOV(5) 26..104 PRODOM PD058205: CA11_CHICK 106..124 PRODOM PD026094: CA11_CHICK 148..175 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 177..207 PRODOM PD018836: CA11(3) 241..267 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 269..303 PRODOM PD000540: H1(18) O76786(11) TONB(10) 306..472 PRODOM PD000540: H1(18) O76786(11) TONB(10) 491..663 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 683..713 PRODOM PD000540: H1(18) O76786(11) TONB(10) 735..852 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 856..888 PRODOM PD187138: CA11(3) O76045(1) 929..960 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 962..992 PRODOM PD002493: CA11(3) CA12(2) 1000..1027 PRODOM PD007903: 1079..1102 PRODOM PD000007: CA14(90) CA24(88) CA13(40) 1104..1133 PRODOM PD038429: CA11(3) O76045(1) Q9YIB4(1) 1185..1232 PRODOM PD002078: CA11(3) CA12(2) CA13(2) 1234..1451 __________________ Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 3.4, P = 0.97 Identities = 23/71 (32%), Positives = 28/71 (39%), Frame = +1 Query: 31 AXEPGKXHHENRSPSASEL*S--GXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E +P A G A DR + G P AP P PV P G G Sbjct: 995 AGPPGEAGREG-APGAEGAPGRDGAAGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 1053 Query: 205 ESGCLGMQPQSG 240 + G G +G Sbjct: 1054 DRGETGPAGPAG 1065 Minus Strand HSPs: Score = 70 (24.6 bits), Expect = 0.55, Sum P(2) = 0.42 Identities = 18/38 (47%), Positives = 22/38 (57%), Frame = -1 Query: 251 RNLPPDWGCIPKQPDSPTAPRGATGS-GHDGALTLAGAP 138 R P + G + P P PRGA G+ G+DGA AGAP Sbjct: 668 RGFPGERGV--QGPPGPQGPRGANGAPGNDGAKGDAGAP 704 Score = 37 (13.0 bits), Expect = 0.55, Sum P(2) = 0.42 Identities = 11/30 (36%), Positives = 13/30 (43%), Frame = -3 Query: 150 GWRPLPGDLGPVRH*GRXSRL-QFGRRGRP 64 G LPG +GP GR + G G P Sbjct: 1143 GLNGLPGPIGPPGPRGRTGEVGPVGPPGPP 1172 >gi|2136732|pir||I45876 collagen alpha 1(II) chain - bovine (fragment) >gi|457187|gb|AAA30436.1| (L28918) cyanogen bromide [Bos taurus] Length = 93 Frame 1 hits (HSPs): __________________ __________________________________________________ Database sequence: | | | | | | 93 0 20 40 60 80 Plus Strand HSPs: Score = 67 (23.6 bits), Expect = 0.59, P = 0.45 Identities = 15/34 (44%), Positives = 18/34 (52%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSG 240 GAP AP P P P G G +G LG + Q+G Sbjct: 59 GAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTG 92 >gi|159960|gb|AAA29439.1| (M24558) collagen-like protein [Paracentrotus lividus] Length = 290 Frame 1 hits (HSPs): _____________ __________________________________________________ Database sequence: | | | | | | | 290 0 50 100 150 200 250 Plus Strand HSPs: Score = 74 (26.0 bits), Expect = 0.61, P = 0.45 Identities = 23/72 (31%), Positives = 28/72 (38%), Frame = +1 Query: 40 PGKXHHEN-RSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGESGC 216 PG R P S +G V +R + G P AP P P RG GE G Sbjct: 108 PGSQGESGERGPRGSVGPAGPPGGVGERGPM---GPPGMSGAPGAPGPKGDRGLPGERGA 164 Query: 217 LGMQPQSGGKFRP 255 G + +G RP Sbjct: 165 NGPKGSAGESGRP 177 >gi|7510834|pir||T28999 hypothetical protein ZC513.8 - Caenorhabditis elegans >gi|1255433|gb|AAC48270.1| (U53155) Similar to cuticular collagen; coded for by C. elegans cDNA yk58e6.3; coded for by C. elegans cDNA yk71h12.3; coded for by C. elegans cDNA yk100d10.3; coded for by C. elegans cDNA yk100d4.3; coded for by C. elegans cDNA yk123g7.3; coded for by> Length = 303 Frame 1 hits (HSPs): ________________ __________________________________________________ Database sequence: | | | | | | || 303 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 74 (26.0 bits), Expect = 0.65, P = 0.48 Identities = 27/90 (30%), Positives = 37/90 (41%), Frame = +1 Query: 31 AXEPGKXHHENRSPSASE-L*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGE 207 A +PG + +P+ SE L G D G P P P P+G+ G Sbjct: 199 AGQPGP-QGDAGTPAQSEPLTPGAPGEAGDAGPAGPPGPPGAPGNDGPPGPPGPKGSPGP 257 Query: 208 SGCLGMQPQSGGKFRP-RLNT-GERPIANKY 294 G G+ Q+G P + T GE+ I KY Sbjct: 258 DGPAGVDGQAGPPGPPGQAGTPGEKGICPKY 288 >gi|930045|emb|CAA33387.1| (X15332) alpha-1 (III) collagen [Homo sapiens] Length = 1078 Frame 3 hits (HSPs): ___ Frame 2 hits (HSPs): __ Frame 1 hits (HSPs): ____ __________________________________________________ Database sequence: | | | | | | | | | 1078 0 150 300 450 600 750 900 1050 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 0.70, Sum P(2) = 0.50 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1 Query: 130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276 P PA + A P P PRG VG SG G SG G PR N GER Sbjct: 971 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 1025 Score = 41 (14.4 bits), Expect = 0.70, Sum P(2) = 0.50 Identities = 13/43 (30%), Positives = 17/43 (39%), Frame = +3 Query: 30 GXRTGKXPP*KSVALGVRIVVWRXVLSGGPG-PSPLEGGASQG 155 G + PP LG+ + L+G PG P P QG Sbjct: 786 GSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQG 828 Score = 35 (12.3 bits), Expect = 6.0, Sum P(2) = 1.0 Identities = 8/19 (42%), Positives = 9/19 (47%), Frame = +2 Query: 80 PNCSLEXRPQWRTGPKSPG 136 P + E PQ GP PG Sbjct: 461 PGKNGEYGPQGPPGPTGPG 479 >gi|553198|gb|AAA51816.1| (M31731) bcl-3 protein [Homo sapiens] Length = 77 Frame 1 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | | 77 0 20 40 60 Plus Strand HSPs: Score = 66 (23.2 bits), Expect = 0.75, P = 0.53 Identities = 18/41 (43%), Positives = 24/41 (58%), Frame = +1 Query: 121 AQVPWKGAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGG 243 A +P + P +RAPS P+P APRGA G + + P GG Sbjct: 20 AALPLRKRP--LRAPS-PEPAAPRGAAGL--VVPLDPLRGG 55 >gi|5921192|sp|P02467|CA21_CHICK COLLAGEN ALPHA 2(I) CHAIN PRECURSOR Length = 1362 Frame 2 hits (HSPs): __ Frame 1 hits (HSPs): ___ Annotated Domains: _ _ __________________________________________________ Database sequence: | | | | 1362 0 500 1000 __________________ Annotated Domains: Entrez modified site: CONVERTED TO AN ALDEHYDE 83 Entrez hydroxylation site 439 Entrez hydroxylation site 442 __________________ Plus Strand HSPs: Score = 69 (24.3 bits), Expect = 0.76, Sum P(2) = 0.53 Identities = 18/47 (38%), Positives = 20/47 (42%), Frame = +1 Query: 136 KGAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRL--NTG 270 KG P V P P P G GE G G+ G K P L +TG Sbjct: 619 KGEPGNVGPAGAPGPAGPGGIPGERGVAGVPGGKGEKGAPGLRGDTG 665 Score = 36 (12.7 bits), Expect = 0.76, Sum P(2) = 0.53 Identities = 10/31 (32%), Positives = 13/31 (41%), Frame = +2 Query: 65 GRPRRPNCSLEXRPQWRTGPKSP-GRGRQPG 154 G P P + PQ GP P G+ + G Sbjct: 114 GVPGEPGEPGQTGPQGPRGPPGPPGKAGEDG 144 >gi|192262|gb|AAA37333.1| (M14423) pro-alpha-1 type I collagen [Mus musculus] >gi|224870|prf||1202297A collagen alpha1(I),pro [Mus musculus] Length = 611 Frame 1 hits (HSPs): ______ __________________________________________________ Database sequence: | | | | || 611 0 150 300 450 600 Plus Strand HSPs: Score = 77 (27.1 bits), Expect = 0.78, P = 0.54 Identities = 24/71 (33%), Positives = 29/71 (40%), Frame = +1 Query: 31 AXEPGKXHHENRSPSA--SEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E SP A S G + DR + G P AP P PV P G G Sbjct: 478 AGPPGESGREG-SPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 536 Query: 205 ESGCLGMQPQSG 240 + G G +G Sbjct: 537 DRGETGPAGPAG 548 >gi|8922952 ref|NP_060836.1| hypothetical protein FLJ11230 [Homo sapiens] >gi|7023765|dbj|BAA92080.1| (AK002092) unnamed protein product [Homo sapiens] Length = 217 Frame -1 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | | | 217 0 50 100 150 200 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 0.80, P = 0.55 Identities = 14/48 (29%), Positives = 22/48 (45%), Frame = -1 Query: 236 DWGCIPKQPDSPTAPRGATGSGHDGALTLAGAPFQGTWARSATEDASP 93 D G +P+ P+GA SG G ++ + + G W ED +P Sbjct: 7 DGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAP 54 >gi|2143752|pir||I60384 gene T1 protein - rat (fragment) >gi|531376|emb|CAA56213.1| (X79816) T1 [Rattus norvegicus] Length = 53 Frame 2 hits (HSPs): _________________ Frame 1 hits (HSPs): ____________ __________________________________________________ Database sequence: | | | | 53 0 20 40 Plus Strand HSPs: Score = 41 (14.4 bits), Expect = 0.85, Sum P(2) = 0.57 Identities = 9/18 (50%), Positives = 11/18 (61%), Frame = +2 Query: 104 PQWRTGPKSP-GRGRQPG 154 PQ TGP P G+ +PG Sbjct: 5 PQGATGPLGPKGQTGEPG 22 Score = 35 (12.3 bits), Expect = 0.85, Sum P(2) = 0.57 Identities = 6/12 (50%), Positives = 8/12 (66%), Frame = +1 Query: 178 PVAPRGAVGESG 213 P P+GA G +G Sbjct: 38 PAGPQGAPGPAG 49 >gi|2119160|pir||I50629 collagen - chicken (fragment) >gi|63308|emb|CAA23695.1| (V00401) collagen [Gallus gallus] Length = 473 Frame 1 hits (HSPs): ________ __________________________________________________ Database sequence: | | | | | 473 0 150 300 450 Plus Strand HSPs: Score = 75 (26.4 bits), Expect = 0.93, P = 0.61 Identities = 23/71 (32%), Positives = 28/71 (39%), Frame = +1 Query: 31 AXEPGKXHHENRSPSASEL*S--GXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVG 204 A PG+ E +P A G A DR + G P AP P PV P G G Sbjct: 15 AGPPGEAGREG-APGAEGAPGRDGAAGPKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNG 73 Query: 205 ESGCLGMQPQSG 240 + G G +G Sbjct: 74 DRGETGPAGPAG 85 >gi|7508879|pir||T28770 hypothetical protein W03D2.1 - Caenorhabditis elegans >gi|1947160|gb|AAC48255.1| (AF000298) weak similarity to collagens; glycine- and proline-rich [Caenorhabditis elegans] Length = 539 Frame 3 hits (HSPs): __ __ Frame 1 hits (HSPs): _____ __________________________________________________ Database sequence: | | | | | 539 0 150 300 450 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 0.99, Sum P(2) = 0.63 Identities = 21/50 (42%), Positives = 25/50 (50%), Frame = +1 Query: 130 PWKGAPARVRAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERP 279 P G+P RA S P P PRG+ +G L PQ+GG P TG P Sbjct: 305 PAGGSPPPPRAGSPPPPPPPRGSP-PTGSLP-PPQAGGS-PPPAGTGSPP 351 Score = 33 (11.6 bits), Expect = 0.99, Sum P(2) = 0.63 Identities = 6/9 (66%), Positives = 6/9 (66%), Frame = +3 Query: 30 GXRTGKXPP 56 G RTG PP Sbjct: 86 GGRTGSPPP 94 Score = 33 (11.6 bits), Expect = 0.99, Sum P(2) = 0.63 Identities = 6/9 (66%), Positives = 6/9 (66%), Frame = +3 Query: 30 GXRTGKXPP 56 G RTG PP Sbjct: 144 GGRTGSPPP 152 >gi|11437170 ref|XP_003597.1| hypothetical protein FLJ11230 [Homo sapiens] Length = 248 Frame -1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | 248 0 50 100 150 200 Minus Strand HSPs: Score = 71 (25.0 bits), Expect = 1.0, P = 0.63 Identities = 14/48 (29%), Positives = 22/48 (45%), Frame = -1 Query: 236 DWGCIPKQPDSPTAPRGATGSGHDGALTLAGAPFQGTWARSATEDASP 93 D G +P+ P+GA SG G ++ + + G W ED +P Sbjct: 38 DGGALPEGLAEEAEPQGAAWSGDSGTVSQSHSSASGPWEDEGAEDGAP 85 >gi|1340174|emb|CAA25821.1| (X01655) type III procollagen (aa 892-1023) [Homo sapiens] Length = 132 Frame 1 hits (HSPs): _____________________ __________________________________________________ Database sequence: | | | | 132 0 50 100 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 1.1, P = 0.66 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1 Query: 130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276 P PA + A P P PRG VG SG G SG G PR N GER Sbjct: 54 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 108 >gi|71415|pir||CGRT2S collagen alpha 2(I) chain - rat (tentative sequence) (fragments) Length = 184 Frame 3 hits (HSPs): _______ Frame 1 hits (HSPs): _______ Annotated Domains: __ __________________________________________________ Database sequence: | | | | | 184 0 50 100 150 __________________ Annotated Domains: Entrez modified site: blocked amino end (Glx) ( 1 Entrez modified site: allysine (Lys) 5 __________________ Plus Strand HSPs: Score = 54 (19.0 bits), Expect = 1.2, Sum P(2) = 0.70 Identities = 10/23 (43%), Positives = 12/23 (52%), Frame = +3 Query: 111 GGPGPSPLEGGASQGESPVVPGP 179 G PGP +G A + P PGP Sbjct: 27 GAPGPQGFQGPAGEPGEPGQPGP 49 Score = 54 (19.0 bits), Expect = 1.2, Sum P(2) = 0.70 Identities = 9/23 (39%), Positives = 14/23 (60%), Frame = +1 Query: 172 PDPVAPRGAVGESGCLGMQPQSG 240 P P+ P G G++G +G P +G Sbjct: 120 PGPIGPAGPRGZAGAIGF-PMTG 141 >gi|7494559|pir||T28887 collagen dpy-10 - Caenorhabditis elegans >gi|1213522|gb|AAA91236.1| (U50191) C. elegans collagen dpy-10 gene (Levy, A.D., Yang, J. and Kramer, J.M. Mol. Biol Cell 4, 803-17, 1993) [Caenorhabditis elegans] Length = 284 Frame 1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | | | | 284 0 50 100 150 200 250 Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 1.2, P = 0.71 Identities = 18/56 (32%), Positives = 23/56 (41%), Frame = +1 Query: 139 GAPARVRAPSC--PDPVAPRGAVGESGCLGMQPQSGGKFRPRLNTGERPIANKYRE 300 GAP P C P P PRG+ G G G+ +G P + N+ RE Sbjct: 93 GAPLETECPGCCIPGPPGPRGSSGTPGKPGLPGNAGKPGMPGTTPNQTCPLNQVRE 148 >gi|4502951 ref|NP_000081.1| collagen, type III, alpha 1; Collagen III, alpha-1 polypeptide [Homo sapiens] >gi|115306|sp|P02461|CA13_HUMAN COLLAGEN ALPHA 1(III) CHAIN PRECURSOR >gi|30058|emb|CAA32583.1| (X14420) prepro-alpha-1 type 3 collagen [Homo sapiens] Length = 1466 Frame 3 hits (HSPs): ___ Frame 2 hits (HSPs): __ Frame 1 hits (HSPs): __ __________________________________________________ Database sequence: | | | | 1466 0 500 1000 Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 1.3, Sum P(2) = 0.73 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1 Query: 130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276 P PA + A P P PRG VG SG G SG G PR N GER Sbjct: 1118 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 1172 Score = 41 (14.4 bits), Expect = 1.3, Sum P(2) = 0.73 Identities = 13/43 (30%), Positives = 17/43 (39%), Frame = +3 Query: 30 GXRTGKXPP*KSVALGVRIVVWRXVLSGGPG-PSPLEGGASQG 155 G + PP LG+ + L+G PG P P QG Sbjct: 933 GSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQG 975 Score = 36 (12.7 bits), Expect = 8.9, Sum P(2) = 1.0 Identities = 8/19 (42%), Positives = 9/19 (47%), Frame = +2 Query: 80 PNCSLEXRPQWRTGPKSPG 136 P + E PQ GP PG Sbjct: 608 PGKNGETGPQGPPGPTGPG 626 >gi|1070603|pir||CGHU7L collagen alpha 1(III) chain precursor - human Length = 1466 Frame 3 hits (HSPs): ___ Frame 2 hits (HSPs): __ Frame 1 hits (HSPs): __ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 1466 0 500 1000 __________________ Annotated Domains: Entrez domain: signal sequence 1..23 Entrez domain: amino-terminal propeptide 24..153 Entrez domain: von Willebrand factor type C rep 31..91 Entrez region: amino-terminal nonhelical telope 154..167 Entrez region: helical 168..1196 Entrez region: cell attachment (R-G-D) motif 1091..1093 Entrez region: carboxyl-terminal nonhelical tel 1197..1221 Entrez domain: carboxyl-terminal propeptide 1222..1466 Entrez domain: fibrillar collagen carboxyl-term 1238..1466 Entrez modified site: pyrrolidone carboxylic ac 24 Entrez cleavage site: Pro-Gln (procollagen N-en 153..154 Entrez modified site: pyrrolidone carboxylic ac 154 Entrez modified site: allysine (Lys) 161 Entrez modified site: allysine (Lys) 1212 Entrez modified site: 5-hydroxylysine (Lys) 263 Entrez modified site: 5-hydroxylysine (Lys) 284 Entrez modified site: 5-hydroxylysine (Lys) 860 Entrez modified site: 5-hydroxylysine (Lys) 977 Entrez modified site: 5-hydroxylysine (Lys) 1106 Entrez binding site: carbohydrate (Lys) (covale 263 Entrez modified site: 5-hydroxylysine (Lys) (pa 584 Entrez modified site: 5-hydroxylysine (Lys) (pa 1094 Entrez cleavage site: Gly-Ile (collagenase) 948..949 Entrez binding site: carbohydrate (Lys) (covale 1106 Entrez modified site: 3-hydroxyproline (Pro) 1162 Entrez cleavage site: Gly-Asp (procollagen C-en 1221..1222 Entrez binding site: carbohydrate (Asn) (covale 1367 PROSITE GRAM_POS_ANCHORING: Gram-positive cocci 643..648 __________________ Plus Strand HSPs: Score = 68 (23.9 bits), Expect = 1.3, Sum P(2) = 0.73 Identities = 23/55 (41%), Positives = 24/55 (43%), Frame = +1 Query: 130 PWKGAPARVR-APSCPDPVAPRGAVGESGCLGMQPQSG-----GKFRPRLNTGER 276 P PA + A P P PRG VG SG G SG G PR N GER Sbjct: 1118 PGSPGPAGQQGAIGSPGPAGPRGPVGPSGPPGKDGTSGHPGPIGPPGPRGNRGER 1172 Score = 41 (14.4 bits), Expect = 1.3, Sum P(2) = 0.73 Identities = 13/43 (30%), Positives = 17/43 (39%), Frame = +3 Query: 30 GXRTGKXPP*KSVALGVRIVVWRXVLSGGPG-PSPLEGGASQG 155 G + PP LG+ + L+G PG P P QG Sbjct: 933 GSPGAQGPPGAPGPLGIAGITGARGLAGPPGMPGPRGSPGPQG 975 Score = 36 (12.7 bits), Expect = 8.9, Sum P(2) = 1.0 Identities = 8/19 (42%), Positives = 9/19 (47%), Frame = +2 Query: 80 PNCSLEXRPQWRTGPKSPG 136 P + E PQ GP PG Sbjct: 608 PGKNGETGPQGPPGPTGPG 626 >gi|3171998|emb|CAA06510.1| (AJ005395) collagen alpha 1 (III) [Rattus norvegicus] Length = 564 Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | 564 0 150 300 450 Plus Strand HSPs: Score = 74 (26.0 bits), Expect = 1.5, P = 0.77 Identities = 22/71 (30%), Positives = 26/71 (36%), Frame = +1 Query: 28 TAXEPGKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGE 207 TA EPG+ + G DR + GAP P P PV P G G+ Sbjct: 106 TAGEPGRDGNPGSDGQPGR--DGSPGGKGDRGENGSPGAPGAPGHPGPPGPVGPSGKNGD 163 Query: 208 SGCLGMQPQSG 240 G G SG Sbjct: 164 RGETGPAGPSG 174 >gi|1814029|gb|AAB41793.1| (U84501) cuticle collagen [Caenorhabditis briggsae] Length = 316 Frame 3 hits (HSPs): ___ Frame 1 hits (HSPs): _______ __________________________________________________ Database sequence: | | | | | | | | 316 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 66 (23.2 bits), Expect = 1.5, Sum P(2) = 0.79 Identities = 16/39 (41%), Positives = 19/39 (48%), Frame = +1 Query: 139 GAPARV-RAPSCPDPVAPRGAVGESGCLGMQPQSGGKFRP 255 GAP +V P P P P G G +G G QP G +P Sbjct: 230 GAPGQVVDVPGTPGPAGPPGPPGPAGAPG-QPGQAGSSQP 268 Score = 35 (12.3 bits), Expect = 1.5, Sum P(2) = 0.79 Identities = 8/16 (50%), Positives = 9/16 (56%), Frame = +3 Query: 105 LSGGPGPSPLEGGASQ 152 L G PGP+ G A Q Sbjct: 204 LPGPPGPAGPPGPAGQ 219 >gi|258774|gb|AAB23914.1| type II collagen alpha 1 chain, COL2A1 [human, Peptide Partial Mutant, 346 aa] Length = 346 Frame 1 hits (HSPs): __________ __________________________________________________ Database sequence: | | | | 346 0 150 300 Plus Strand HSPs: Score = 71 (25.0 bits), Expect = 1.7, P = 0.81 Identities = 20/66 (30%), Positives = 23/66 (34%), Frame = +1 Query: 31 AXEPGKXHHENRSPSASEL*SGXASSVADRAQVPWKGAPARVRAPSCPDPVAPRGAVGES 210 A EPG+ G A DR + GAP P P P P G G+ Sbjct: 280 AGEPGRQGSPGADGPPGR--DGAAEVKGDRGETGAVGAPGTPGPPGSPGPAGPTGKQGDR 337 Query: 211 GCLGMQ 228 G G Q Sbjct: 338 GEAGAQ 343 >gi|2388676|gb|AAB80719.1| (AF015539) precollagen P [Mytilus edulis] Length = 902 Frame 2 hits (HSPs): _____ Frame 1 hits (HSPs): ___ __________________________________________________ Database sequence: | | | | | | || 902 0 150 300 450 600 750 900 Plus Strand HSPs: Score = 64 (22.5 bits), Expect = 1.7, Sum P(2) = 0.81 Identities = 17/44 (38%), Positives = 19/44 (43%), Frame = +1 Query: 139 GAPARVRAPSCPDPVAPRGAVGESGCLGM---QPQSGGKFRPRL 261 G P AP P PRG +G SG G Q GG+ P L Sbjct: 420 GGPGDKGAPGTPGGTGPRGPIGPSGPSGAPGDQGPQGGRGTPGL 463 Score = 51 (18.0 bits), Expect = 1.7, Sum P(2) = 0.81 Identities = 15/41 (36%), Positives = 16/41 (39%), Frame = +2 Query: 32 RXNREXPTMKIGRPRRPNCSLEXRPQWRTGPKSPGRGRQPG 154 R P + G P RP S RP P PGR PG Sbjct: 262 RLGNPGPPGQPGNPGRPGSS--GRPGGSGQPGGPGRPGTPG 300 Score = 45 (15.8 bits), Expect = 6.7, Sum P(2) = 1.0 Identities = 12/31 (38%), Positives = 14/31 (45%), Frame = +2 Query: 65 GRPRRP-NCSLEXRPQWRTGPKSPGRGRQPG 154 G P +P N +P P PG G QPG Sbjct: 297 GTPGKPGNRGQPGQPGGPGQPGHPGAGGQPG 327 WARNING: HSPs involving 98 database sequences were not reported due to the limiting value of parameter B = 50. Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.93 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.325 0.144 0.472 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.350 0.154 0.561 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.316 0.134 0.417 same same same Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.315 0.135 0.448 same same same Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.321 0.142 0.448 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.350 0.157 0.586 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 100 95 10. 66 3 12 22 0.099 32 28 0.12 33 +2 0 100 96 10. 66 3 12 22 0.10 32 28 0.095 34 +1 0 100 95 10. 66 3 12 22 0.10 32 28 0.12 33 -1 0 100 96 10. 66 3 13 22 0.11 32 28 0.095 34 -2 0 100 96 10. 66 3 12 22 0.10 32 28 0.095 34 -3 0 100 97 10. 66 3 12 22 0.10 32 28 0.097 34 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 4:06 PM CST Feb 28, 2001 Format: BLAST # of letters in database: 197,782,623 # of sequences in database: 625,274 # of database sequences satisfying E: 148 No. of states in DFA: 572 (56 KB) Total size of DFA: 130 KB (192 KB) Time to generate neighborhood: 0.01u 0.00s 0.01t Elapsed: 00:00:00 No. of threads or processors used: 6 Search cpu time: 124.11u 0.90s 125.01t Elapsed: 00:00:36 Total cpu time: 124.18u 0.96s 125.14t Elapsed: 00:00:37 Start: Wed Jan 16 12:28:01 2002 End: Wed Jan 16 12:28:38 2002 WARNINGS ISSUED: 2
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000