WU-BLAST 2.0 search of the National Center for Biotechnology Information's NR Protein Database.
BEAUTY post-processing provided by the Human Genome Sequencing Center, Baylor College of Medicine.
BEAUTY Reference:
Worley KC, Culpepper P, Wiese BA, Smith RF. BEAUTY-X: enhanced BLAST searches for DNA queries. Bioinformatics 1998;14(10):890-1. Abstract
Worley KC, Wiese BA, Smith RF. BEAUTY: an enhanced BLAST-based search tool that integrates multiple biological information resources into sequence similarity search results. Genome Res 1995 Sep;5(2):173-84 Abstract
RepeatMasker repeats found in sequence:No Repeats Found.Reference: Gish, Warren (1994-1997). unpublished. Gish, Warren and David J. States (1993). Identification of protein coding regions by database similarity search. Nat. Genet. 3:266-72.Notice: statistical significance is estimated under the assumption that the equivalent of one entire reading frame in the query sequence codes for protein and that significant alignments will involve only coding reading frames.
Query= A01i19 (930 letters)
Translating both strands of query sequence in all 6 reading framesDatabase: nr 505,245 sequences; 158,518,215 total letters.Observed Numbers of Database Sequences Satisfying Various EXPECTation Thresholds (E parameter values) Histogram units: = 2 Sequences : less than 2 sequences EXPECTation Threshold (E parameter) | V Observed Counts--> 10000 457 107 |===================================================== 6310 350 55 |=========================== 3980 295 44 |====================== 2510 251 53 |========================== 1580 198 59 |============================= 1000 139 31 |=============== 631 108 28 |============== 398 80 17 |======== 251 63 7 |=== 158 56 18 |========= 100 38 16 |======== 63.1 22 1 |: 39.8 21 1 |: 25.1 20 1 |: 15.8 19 0 | >>>>>>>>>>>>>>>>>>>>> Expect = 10.0, Observed = 19 <<<<<<<<<<<<<<<<< 10.0 19 0 | 6.31 19 0 | 3.98 19 2 |= 2.51 17 2 |= 1.58 15 0 | 1.00 15 0 | 0.63 15 0 | 0.40 15 0 | 0.25 15 0 | 0.16 15 0 | 0.10 15 0 | 0.063 15 0 | 0.040 15 1 |: Smallest Sum Reading High Probability Sequences producing High-scoring Segment Pairs: Frame Score P(N) N gi|6671963|gb|AAF23222.1|AC013454_9(AC013454) unknown... +3 742 1.4e-72 1 gi|6714407|gb|AAF26096.1|AC012393_22(AC012393) unknow... +3 316 3.4e-46 2 gi|7485938|pir||T01201hypothetical protein F21E10.13 ... +3 404 9.3e-37 1 gi|7110635ref|NP_036396.1| chromosome 22 open reading... +3 335 1.9e-29 1 gi|6572254|emb|CAB63066.1|(AL020993) dJ5O6.2 (novel p... +3 335 1.9e-29 1 gi|7503052|pir||T21697hypothetical protein F40E10.6 -... +3 340 2.3e-29 1 gi|7292104|gb|AAF47516.1|(AE003472) CG12004 gene prod... +3 331 5.1e-29 1 gi|6572253|emb|CAB63065.1|(AL020993) dJ5O6.2 (novel p... +3 317 1.5e-27 1 gi|7297586|gb|AAF52840.1|(AE003626) CG5850 gene produ... +3 275 2.6e-22 1 gi|7023136|dbj|BAA91851.1|(AK001708) unnamed protein ... +3 242 4.5e-19 1 gi|1351659|sp|Q09906|YAJ6_SCHPOHYPOTHETICAL 49.3 KD P... +3 217 2.4e-16 1 gi|6322904ref|NP_012977.1| Ykr051wp >gi|549619|sp|P36... +3 168 3.6e-09 1 gi|7487004|pir||T00451hypothetical protein T14N5.8 - ... +3 153 2.1e-07 2 gi|7485716|pir||T05165hypothetical protein F18E5.190 ... +3 140 2.9e-06 1 gi|7496248|pir||T15541hypothetical protein C18A3.4 - ... +3 106 0.031 1 gi|7485951|pir||T05664hypothetical protein F22I13.130... +3 89 0.83 2 gi|7020081|dbj|BAA90988.1|(AK000169) unnamed protein ... +3 89 0.89 1 gi|7707666|dbj|BAA95343.1|(AB027560) ATPase subunit 6... +3 83 0.95 1 gi|625591|pir||D61399hypothetical early protein E8 - ... +3 66 0.98 1 Locally-aligned regions (HSPs) with respect to query sequence: Locus_ID Frame 3 Hits gi|6671963 | _____________________________________ gi|6714407 | _____________ _________________ gi|7485938 | _____________________________________ gi|7110635 | _____________________________________ gi|6572254 | _____________________________________ gi|7503052 | ___________________________________ gi|7292104 | ___________________________________ gi|6572253 | _________________________________ gi|7297586 | ____________________________________ gi|7023136 | ____________________________________ gi|1351659 | __________________________________ gi|6322904 | _____________________________________ gi|7487004 | _____________ ________ gi|7485716 | ___________________________________ gi|7496248 | ________________________ gi|7485951 | ___________________________________ gi|7020081 | ___________________ gi|7707666 | ________________________ gi|625591 | _______ __________________________________________________ Query sequence: | | | | | | | | 310 0 50 100 150 200 250 300
Use the and icons to retrieve links to Entrez:
>gi|6671963|gb|AAF23222.1|AC013454_9 (AC013454) unknown protein [Arabidopsis thaliana] Length = 422 Frame 3 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | 422 0 150 300 Plus Strand HSPs: Score = 742 (261.2 bits), Expect = 1.4e-72, P = 1.4e-72 Identities = 146/222 (65%), Positives = 166/222 (74%), Frame = +3 Query: 249 MVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLV 428 ++PI+L I+AF+CT GAIAL+L HIYKHLLNYTEP YQR+IVRIVFMVPVYALMSFL+LV Sbjct: 4 LLPIYLIILAFLCTVGAIALALFHIYKHLLNYTEPIYQRYIVRIVFMVPVYALMSFLALV 63 Query: 429 LPQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPL 608 LP+ SIYFNSIRE+YEAWVIYNFLSLCL WVGGPGS ++SLTGR LKPSW L CC+PPL Sbjct: 64 LPKSSIYFNSIREVYEAWVIYNFLSLCLAWVGGPGSVVISLTGRSLKPSWHLMTCCIPPL 123 Query: 609 ALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQ 788 LDG FIR CKQGCLQFV + LV VTL YAKGKYKDGNF + +Y Sbjct: 124 PLDGRFIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFSPDQSYLYLTIIYTISYT 183 Query: 789 WXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914 AL+ F +ACK + + PKF K VVF TY GVL Sbjct: 184 VALYALVLFYVACKDLLQPFNPVPKFVIIKSVVFLTYWQGVL 225 >gi|6714407|gb|AAF26096.1|AC012393_22 (AC012393) unknown protein [Arabidopsis thaliana] Length = 372 Frame 3 hits (HSPs): ________________________ __________________________________________________ Database sequence: | | | | 372 0 150 300 Plus Strand HSPs: Score = 316 (111.2 bits), Expect = 3.4e-46, Sum P(2) = 3.4e-46 Identities = 59/74 (79%), Positives = 69/74 (93%), Frame = +3 Query: 249 MVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLV 428 ++PI+L I+AF+CT GAIAL+L HIYKHLLNYTEP YQR+IVRIVFMVPVYALMSFL+LV Sbjct: 4 LLPIYLIILAFLCTVGAIALALFHIYKHLLNYTEPIYQRYIVRIVFMVPVYALMSFLALV 63 Query: 429 LPQGSIYFNSIREI 470 LP+ SIYFNSIRE+ Sbjct: 64 LPKSSIYFNSIREV 77 Score = 194 (68.3 bits), Expect = 3.4e-46, Sum P(2) = 3.4e-46 Identities = 47/97 (48%), Positives = 54/97 (55%), Frame = +3 Query: 624 FIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKA 803 FIR CKQGCLQFV + LV VTL YAKGKYKDGNF + +Y A Sbjct: 79 FIRRCKQGCLQFVILKPILVAVTLVLYAKGKYKDGNFSPDQSYLYLTIIYTISYTVALYA 138 Query: 804 LLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914 L+ F +ACK + + PKF K VVF TY GVL Sbjct: 139 LVLFYVACKDLLQPFNPVPKFVIIKSVVFLTYWQGVL 175 >gi|7485938|pir||T01201 hypothetical protein F21E10.13 - Arabidopsis thaliana >gi|3047085|gb|AAC13598.1| (AF058914) F21E10.13 gene product [Arabidopsis thaliana] Length = 396 Frame 3 hits (HSPs): _________________________ __________________________________________________ Database sequence: | | | | 396 0 150 300 Plus Strand HSPs: Score = 404 (142.2 bits), Expect = 9.3e-37, P = 9.3e-37 Identities = 83/145 (57%), Positives = 93/145 (64%), Frame = +3 Query: 480 WVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGFIR*CKQGCLQF 659 WVIYNFLSLCL WVGGPGS +LSL+GR LKPSW L CC PPL LDG FIR CKQGCLQF Sbjct: 55 WVIYNFLSLCLAWVGGPGSVVLSLSGRSLKPSWSLMTCCFPPLTLDGRFIRRCKQGCLQF 114 Query: 660 VNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACKXIC 839 V + LV VTL YAKGKYKDGNF + +Y AL+ F +AC+ + Sbjct: 115 VILKPILVAVTLVLYAKGKYKDGNFNPDQAYLYLTIIYTISYTVALYALVLFYMACRDLL 174 Query: 840 FNNSSXPKFXXXKIVVFPTYGXGVL 914 + PKF K VVF TY GVL Sbjct: 175 QPFNPVPKFVIIKSVVFLTYWQGVL 199 Score = 232 (81.7 bits), Expect = 3.8e-18, P = 3.8e-18 Identities = 47/84 (55%), Positives = 63/84 (75%), Frame = +3 Query: 249 MVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPV-YALMSFLSL 425 ++P +L IVAF+CT GAIAL++ HIY+HLLNYTEPTYQR+IVRI+FMVPV + + +FLSL Sbjct: 4 LIPFYLNIVAFLCTVGAIALAIFHIYRHLLNYTEPTYQRYIVRIIFMVPVTWVIYNFLSL 63 Query: 426 VLP----QGSIYFN-SIREIYEAW 482 L GS+ + S R + +W Sbjct: 64 CLAWVGGPGSVVLSLSGRSLKPSW 87 >gi|7110635 ref|NP_036396.1| chromosome 22 open reading frame 5 >gi|5596705|emb|CAB51403.1| (AL096879) hypothetical protein [Homo sapiens] Length = 373 Frame 3 hits (HSPs): ________________________________ __________________________________________________ Database sequence: | | | | 373 0 150 300 Plus Strand HSPs: Score = 335 (117.9 bits), Expect = 1.9e-29, P = 1.9e-29 Identities = 83/231 (35%), Positives = 120/231 (51%), Frame = +3 Query: 255 PIFLYIVAFICTCG-----AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFL 419 P+FL A G A+ ++ IY HL Y+ P QR+IVRI+F+VP+YA S+L Sbjct: 4 PVFLMTTAAQAISGFFVWTALLITCHQIYMHLRCYSCPNEQRYIVRILFIVPIYAFDSWL 63 Query: 420 SLVL---PQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXN 590 SL+ Q +YF ++R+ YEA VIYNFLSLC E++GG S + + G+ ++ S Sbjct: 64 SLLFFTNDQYYVYFGTVRDCYEALVIYNFLSLCYEYLGGESSIMSEIRGKPIESSCMYGT 123 Query: 591 CCLPPLALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSL 770 CCL GF+R CKQ LQF + + V T+ A GKY+DG+F + + Sbjct: 124 CCLWGKTYSIGFLRFCKQATLQFCVVKPLMAVSTVVLQAFGKYRDGDFDVTSGYLYVTII 183 Query: 771 YFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923 Y AL F A + + S KF K V+F ++ G+L + Sbjct: 184 YNISVSLALYALFLFYFATRELLSPYSPVLKFFMVKSVIFLSFWQGMLLAI 234 >gi|6572254|emb|CAB63066.1| (AL020993) dJ5O6.2 (novel protein similar to C. elegans F40E10.6 (isoform 1)) [Homo sapiens] Length = 293 Frame 3 hits (HSPs): ________________________________________ __________________________________________________ Database sequence: | | | | | | | 293 0 50 100 150 200 250 Plus Strand HSPs: Score = 335 (117.9 bits), Expect = 1.9e-29, P = 1.9e-29 Identities = 83/231 (35%), Positives = 120/231 (51%), Frame = +3 Query: 255 PIFLYIVAFICTCG-----AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFL 419 P+FL A G A+ ++ IY HL Y+ P QR+IVRI+F+VP+YA S+L Sbjct: 4 PVFLMTTAAQAISGFFVWTALLITCHQIYMHLRCYSCPNEQRYIVRILFIVPIYAFDSWL 63 Query: 420 SLVL---PQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXN 590 SL+ Q +YF ++R+ YEA VIYNFLSLC E++GG S + + G+ ++ S Sbjct: 64 SLLFFTNDQYYVYFGTVRDCYEALVIYNFLSLCYEYLGGESSIMSEIRGKPIESSCMYGT 123 Query: 591 CCLPPLALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSL 770 CCL GF+R CKQ LQF + + V T+ A GKY+DG+F + + Sbjct: 124 CCLWGKTYSIGFLRFCKQATLQFCVVKPLMAVSTVVLQAFGKYRDGDFDVTSGYLYVTII 183 Query: 771 YFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923 Y AL F A + + S KF K V+F ++ G+L + Sbjct: 184 YNISVSLALYALFLFYFATRELLSPYSPVLKFFMVKSVIFLSFWQGMLLAI 234 >gi|7503052|pir||T21697 hypothetical protein F40E10.6 - Caenorhabditis elegans >gi|3876602|emb|CAA93657.1| (Z69790) cDNA EST yk376g11.5 comes from this gene~cDNA EST yk442f1.5 comes from this gene~cDNA EST yk455h10.5 comes from this gene~cDNA EST yk457h6.5 comes from this gene~cDNA EST yk464d8.5 comes from this gene [Caenorhabditis elegans] >gi|3877015|emb|CAA93669.1| (Z69792) cDNA EST yk376g11.5 comes from this gene~cDNA EST yk442f1.5 comes from this gene~cDNA EST yk455h10.5 comes from this gene~cDNA EST yk457h6.5 comes from this gene~cDNA EST yk464d8.5 comes from this gene [Caenorhabditis elegans] Length = 595 Frame 3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | | 595 0 150 300 450 Plus Strand HSPs: Score = 340 (119.7 bits), Expect = 2.3e-29, P = 2.3e-29 Identities = 82/211 (38%), Positives = 120/211 (56%), Frame = +3 Query: 300 IALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS--IYFNSIREIY 473 I LS L IY+HL Y+ P QR+IVRI+F+VP+YA S+LSL+ + IYFNSIR+ Y Sbjct: 226 IQLSKLQIYQHLRFYSCPAEQRWIVRILFIVPIYAFDSWLSLIFFSDNVYIYFNSIRDCY 285 Query: 474 EAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLX-NCCLPPLALDGGFIR*CKQGC 650 EA+VIY+FLSLC E++GG + + + G+ ++P+ +L CCL F+R CKQ Sbjct: 286 EAFVIYSFLSLCYEYLGGESNIMAEIRGKPIRPTNYLTCTCCLAGKQYTIEFLRFCKQAT 345 Query: 651 LQFVNFETHLVVVTLYFYAKGKYKDGNFF--QXIIIVS*QSLYFFLTQWXXKALLXFXLA 824 LQF + + V+TL A GKY+DGN+ Q I ++ +Y + F A Sbjct: 346 LQFCFIKPIMAVITLMLTAIGKYEDGNWSLDQGYIYIT--LVYNVSISLALYGMFLFYAA 403 Query: 825 CKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923 + + KF K V+F ++ G L + Sbjct: 404 TRDLLSPYRPVLKFLTVKSVIFLSFWQGFLIAI 436 >gi|7292104|gb|AAF47516.1| (AE003472) CG12004 gene product [Drosophila melanogaster] Length = 348 Frame 3 hits (HSPs): ________________________________ __________________________________________________ Database sequence: | | | | 348 0 150 300 Plus Strand HSPs: Score = 331 (116.5 bits), Expect = 5.1e-29, P = 5.1e-29 Identities = 75/217 (34%), Positives = 115/217 (52%), Frame = +3 Query: 270 IVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS-- 443 ++A +C A+ ++ IY+HL YT P QR+IVRI+F+VP+YA S++SL+ Sbjct: 24 VLAGVCVWAALFITCQQIYQHLRWYTNPQEQRWIVRILFIVPIYATYSWISLLFFNSDNV 83 Query: 444 -IYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDG 620 IYF ++R+ YEA+VIYNFLSLC E++GG G+ + + G+ +K S CCL Sbjct: 84 YIYFFTVRDCYEAFVIYNFLSLCYEYLGGEGNIMSEIRGKPIKTSCLYGTCCLKGKTYTI 143 Query: 621 GFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXK 800 GF+R CKQ LQF + + + ++ A G Y DG++ + +Y Sbjct: 144 GFLRFCKQATLQFCLVKPLVAFIIIFLQAFGHYHDGDWSADGGYIYITIIYNISVSLALY 203 Query: 801 ALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGV 911 L F A + + KF K V+F ++ GV Sbjct: 204 GLYLFYFATRDLLTPFEPVLKFCTIKSVIFLSFWQGV 240 >gi|6572253|emb|CAB63065.1| (AL020993) dJ5O6.2 (novel protein similar to C. elegans F40E10.6 (isoform 2)) [Homo sapiens] Length = 261 Frame 3 hits (HSPs): _______________________________________ __________________________________________________ Database sequence: | | | | | | | 261 0 50 100 150 200 250 Plus Strand HSPs: Score = 317 (111.6 bits), Expect = 1.5e-27, P = 1.5e-27 Identities = 75/201 (37%), Positives = 108/201 (53%), Frame = +3 Query: 330 HLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVL---PQGSIYFNSIREIYEAWVIYNFL 500 HL Y+ P QR+IVRI+F+VP+YA S+LSL+ Q +YF ++R+ YEA VIYNFL Sbjct: 2 HLRCYSCPNEQRYIVRILFIVPIYAFDSWLSLLFFTNDQYYVYFGTVRDCYEALVIYNFL 61 Query: 501 SLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGFIR*CKQGCLQFVNFETHL 680 SLC E++GG S + + G+ ++ S CCL GF+R CKQ LQF + + Sbjct: 62 SLCYEYLGGESSIMSEIRGKPIESSCMYGTCCLWGKTYSIGFLRFCKQATLQFCVVKPLM 121 Query: 681 VVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACKXICFNNSSXP 860 V T+ A GKY+DG+F + +Y AL F A + + S Sbjct: 122 AVSTVVLQAFGKYRDGDFDVTSGYLYVTIIYNISVSLALYALFLFYFATRELLSPYSPVL 181 Query: 861 KFXXXKIVVFPTYGXGVLFPL 923 KF K V+F ++ G+L + Sbjct: 182 KFFMVKSVIFLSFWQGMLLAI 202 >gi|7297586|gb|AAF52840.1| (AE003626) CG5850 gene product [Drosophila melanogaster] Length = 569 Frame 3 hits (HSPs): ____________________ __________________________________________________ Database sequence: | | | | | 569 0 150 300 450 Plus Strand HSPs: Score = 275 (96.8 bits), Expect = 2.6e-22, P = 2.6e-22 Identities = 69/217 (31%), Positives = 117/217 (53%), Frame = +3 Query: 264 LYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS 443 L ++ + A+ +S+ HI +H++++T+P Q+ I+RI++MVP+YAL +++ L P+ S Sbjct: 51 LILIGGLFVLSAVPVSIWHIIQHVIHFTKPILQKHIIRILWMVPIYALNAWIGLFFPKHS 110 Query: 444 IYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGG 623 IY +S+RE YEA+VIYNF+ L ++ ++ + P +F CC+ P + Sbjct: 111 IYVDSLRECYEAYVIYNFMVYLLNYLNLGMDLEATMEYKPQVPHFFPL-CCMRPWVMGRE 169 Query: 624 FIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNF-----FQXIIIVS*QSLYFFLTQ 788 FI CK G LQ+ +++ G Y +G F F I++V+ ++ F+ Sbjct: 170 FIHNCKHGILQYTVVRPITTFISVICELCGVYGEGEFAGNVAFPYIVVVN--NISQFVAM 227 Query: 789 WXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914 + L+ F A K PKF K VVF ++ GVL Sbjct: 228 Y---CLVLFYRANKEDLKPMKPIPKFLCIKAVVFFSFFQGVL 266 >gi|7023136|dbj|BAA91851.1| (AK001708) unnamed protein product [Homo sapiens] Length = 438 Frame 3 hits (HSPs): __________________________ __________________________________________________ Database sequence: | | | | 438 0 150 300 Plus Strand HSPs: Score = 242 (85.2 bits), Expect = 4.5e-19, P = 4.5e-19 Identities = 67/219 (30%), Positives = 110/219 (50%), Frame = +3 Query: 267 YIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGSI 446 + +A I I +SL I +HL++YT+P Q+ I+RI++MVP+Y+L S+++L P +I Sbjct: 49 WFIAGIFLLLTIPISLWVILQHLVHYTQPELQKPIIRILWMVPIYSLDSWIALKYPGIAI 108 Query: 447 YFNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGF 626 Y ++ RE YEA+VIYNF+ ++ ++ + + F CC PP A+ Sbjct: 109 YVDTCRECYEAYVIYNFMGFLTNYLTNRYPNLVLILEAKDQQKHFPPLCCCPPWAMGEVL 168 Query: 627 IR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNF-----FQXIIIVS*QSLYFFLTQW 791 + CK G LQ+ +V L G Y +GNF + ++I++ S F + Sbjct: 169 LFRCKLGVLQYTVVRPFTTIVALICELLGIYDEGNFSFSNAWTYLVIINNMSQLFAMY-- 226 Query: 792 XXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923 LL F K KF ++VVF ++ V+ L Sbjct: 227 ---CLLLFYKVLKEELSPIQPVGKFLCVRLVVFVSFWQAVVIAL 267 >gi|1351659|sp|Q09906|YAJ6_SCHPO HYPOTHETICAL 49.3 KD PROTEIN C30D11.06C IN CHROMOSOME I >gi|2130405|pir||S62564 hypothetical protein SPAC30D11.06c - fission yeast (Schizosaccharomyces pombe) >gi|7491068|pir||T38593 hypothetical protein SPAC30D11.06c - fission yeast (Schizosaccharomyces pombe) >gi|1065893|emb|CAA91892.1| (Z67961) hypothetical protein [Schizosaccharomyces pombe] Length = 426 Frame 3 hits (HSPs): _________________________ Annotated Domains: __________________________________________________ __________________________________________________ Database sequence: | | | | 426 0 150 300 __________________ Annotated Domains: DOMO DM06417: 1..340 DOMO DM08523: 341..381 Entrez Transmembrane region: POTENTIAL. 39..59 Entrez Transmembrane region: POTENTIAL. 73..93 Entrez Transmembrane region: POTENTIAL. 133..153 Entrez Transmembrane region: POTENTIAL. 172..192 Entrez Transmembrane region: POTENTIAL. 223..243 PRODOM PD014035: 4..272 PRODOM PD128189: YAJ6_SCHPO 274..425 __________________ Plus Strand HSPs: Score = 217 (76.4 bits), Expect = 2.4e-16, P = 2.4e-16 Identities = 62/203 (30%), Positives = 101/203 (49%), Frame = +3 Query: 297 AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQ-GSIYFNSIREIY 473 A+ LS + I HL NY +P QR +VRI+ M+ +Y+ +SFLS+ + GSI F REIY Sbjct: 16 ALVLSCISIITHLKNYKKPVLQRSVVRILMMIVIYSSVSFLSVYNEKIGSI-FEPFREIY 74 Query: 474 EAWVIYNFLSLCLEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALDGGF-IR*CKQGC 650 EA+ +Y F L ++++GG + ++SL G + +P + N + L + K+G Sbjct: 75 EAFALYCFFCLLIDYLGGERAAVISLHGHLPRPRLWPLNYLQDDIDLSDPYTFLSIKRGI 134 Query: 651 LQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACK 830 LQ+ + LV+ L G Y + Q + + L+ L L + L Sbjct: 135 LQYTWLKPFLVIAVLLTKVTGVYDRED--QPVYASA--DLWIGLVYNISITLSLYSLTTF 190 Query: 831 XICFNNSSXP-----KFXXXKIVVFPTY 899 +C + P KF K ++F +Y Sbjct: 191 WVCLHEELAPFRPFPKFLSVKAIIFASY 218 >gi|6322904 ref|NP_012977.1| Ykr051wp >gi|549619|sp|P36142|YK31_YEAST HYPOTHETICAL 48.8 KD PROTEIN IN TRK2-MRS4 INTERGENIC REGION >gi|539262|pir||S38125 hypothetical protein YKR051w - yeast (Saccharomyces cerevisiae) >gi|486505|emb|CAA82129.1| (Z28276) ORF YKR051w [Saccharomyces cerevisiae] Length = 418 Frame 3 hits (HSPs): ___________________________ __________________________________________________ Database sequence: | | | | 418 0 150 300 Plus Strand HSPs: Score = 168 (59.1 bits), Expect = 3.6e-09, P = 3.6e-09 Identities = 62/223 (27%), Positives = 100/223 (44%), Frame = +3 Query: 246 KMVPIFLYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSL 425 K++ +LY + A +S I +HLLNY +P QR +RI+ +VP++++ + Sbjct: 4 KLLCWWLYWPCVYSSIIATIISFYTITRHLLNYRKPYEQRLSIRILLLVPIFSVSCASGI 63 Query: 426 VLPQGS-IYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXI--LSLTGRVLK-PSWFLXNC 593 + P+ + Y + IRE YEA+VIY F + +GG + I LSL + P + Sbjct: 64 IKPEAAQFYVDPIREFYEAFVIYTFFTFLTLLLGGERNIITVLSLNHAPTRHPIPLIGKI 123 Query: 594 CLPPLALDGGF-IR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSL 770 C P+ L F K+G LQ+V F+ TL A K F+ + V Sbjct: 124 C-KPIDLSDPFDFLFVKKGILQYVWFKPFYCFGTLICSAWKLPK----FEIFLNV----F 174 Query: 771 YFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914 Y W +L F KF K+++F +Y ++ Sbjct: 175 YNISVTWSLYSLALFWKCLYPELTPYKPWLKFLCVKLIIFASYWQSII 222 >gi|7487004|pir||T00451 hypothetical protein T14N5.8 - Arabidopsis thaliana >gi|3540198|gb|AAC34348.1| (AC004260) Unknown protein [Arabidopsis thaliana] Length = 500 Frame 3 hits (HSPs): ________ _____ __________________________________________________ Database sequence: | | | | | 500 0 150 300 450 Plus Strand HSPs: Score = 153 (53.9 bits), Expect = 2.1e-07, Sum P(2) = 2.1e-07 Identities = 32/76 (42%), Positives = 49/76 (64%), Frame = +3 Query: 297 AIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGSIYFNSIREIYE 476 AI L + I++HL +Y +P Q+F++ ++ MVPVYA+ SFLSLV + + IR+ YE Sbjct: 53 AILLPMYLIFEHLASYNQPEEQKFLIGLILMVPVYAVESFLSLVNSEAAFNCEVIRDCYE 112 Query: 477 AWVIYNF---LSLCLE 515 A+ +Y F L CL+ Sbjct: 113 AFALYCFERYLIACLD 128 Score = 40 (14.1 bits), Expect = 2.1e-07, Sum P(2) = 2.1e-07 Identities = 11/42 (26%), Positives = 16/42 (38%), Frame = +3 Query: 789 WXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVL 914 W L+ F K KF K +VF T+ G++ Sbjct: 249 WALYCLVQFYNVIKDKLAPIKPLAKFLTFKSIVFLTWWQGII 290 >gi|7485716|pir||T05165 hypothetical protein F18E5.190 - Arabidopsis thaliana >gi|3080401|emb|CAA18721.1| (AL022603) putative protein [Arabidopsis thaliana] >gi|4455265|emb|CAB36801.1| (AL035527) putative protein [Arabidopsis thaliana] >gi|7268954|emb|CAB81264.1| (AL161555) putative protein [Arabidopsis thaliana] Length = 294 Frame 3 hits (HSPs): ______________________________________ __________________________________________________ Database sequence: | | | | | | | 294 0 50 100 150 200 250 Plus Strand HSPs: Score = 140 (49.3 bits), Expect = 2.9e-06, P = 2.9e-06 Identities = 59/218 (27%), Positives = 102/218 (46%), Frame = +3 Query: 273 VAFICTCGAIALSLLH-----IYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQ 437 + F C+ ++ L+L H + +HL ++ P Q+ I+ IV M P+YA++SF+ L+ + Sbjct: 12 ITFYCSAFSVLLTL-HFTIQLVSQHLFHWKNPKEQKAILIIVLMAPIYAVVSFIGLLEVK 70 Query: 438 GS----IYFNSIREIYEAWVIYNFLSLCLEWVG-GPGSXIL--SLTGRVLKPSWFLXNCC 596 GS ++ SI+E YEA VI FL+L ++ IL + GR + S F Sbjct: 71 GSETFFLFLESIKECYEALVIAKFLALMYSYLNISMSKNILPDGIKGREIHHS-FPMTLF 129 Query: 597 LPPLA-LDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNFFQXIIIVS*QSLY 773 P + LD ++ K QFV + + G Y + IIV+ Sbjct: 130 QPHVVRLDRHTLKLLKYWTWQFVVIRPVCSTLMIALQLIGFYPSWLSWTFTIIVN----- 184 Query: 774 FFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGV 911 F ++ ++ + + K + +N KF K +VF + G+ Sbjct: 185 FSVSLALYSLVIFYHVFAKELAPHNPLA-KFLCIKGIVFFVFWQGI 229 >gi|7496248|pir||T15541 hypothetical protein C18A3.4 - Caenorhabditis elegans >gi|861347|gb|AAA68368.1| (U28944) C18A3.4 gene product [Caenorhabditis elegans] Length = 342 Frame 3 hits (HSPs): _______________________ __________________________________________________ Database sequence: | | | | 342 0 150 300 Plus Strand HSPs: Score = 106 (37.3 bits), Expect = 0.032, P = 0.031 Identities = 36/150 (24%), Positives = 70/150 (46%), Frame = +3 Query: 273 VAFICTCGAIALSLLH-IYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGSIY 449 VA T G + L++LH IY H T + + IV + P+ +L++ +++ +P+ Sbjct: 48 VATAVTVGTVCLAVLHLIYIHFY-ITHSSRRLHIVLLACTAPLVSLLALVAMYMPRVWFL 106 Query: 450 FNSIREIYEAWVIYNFLSLCLEWVGGPGSXILSLTGR-----VLKPSWFLXNCCLPPLAL 614 + + +Y ++ ++ + L L G + + + R + P + CLP + L Sbjct: 107 SHLLSFLYFSFALWVIICLLLHIFDGHHALVTKMMQRLQYVEIATPPFCCLFPCLPKVRL 166 Query: 615 DGGFIR*CKQGCLQ--FVNFETHLVVVTLYF 701 +G IR C+ +Q V LV + +YF Sbjct: 167 EGKKIRWCELMVMQAPIVRLFATLVSLVIYF 197 >gi|7485951|pir||T05664 hypothetical protein F22I13.130 - Arabidopsis thaliana >gi|4539344|emb|CAB37492.1| (AL035539) putative protein [Arabidopsis thaliana] >gi|7270820|emb|CAB80501.1| (AL161593) putative protein [Arabidopsis thaliana] Length = 466 Frame 3 hits (HSPs): _______________________ __________________________________________________ Database sequence: | | | | | 466 0 150 300 450 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 1.8, Sum P(2) = 0.83 Identities = 50/199 (25%), Positives = 74/199 (37%), Frame = +3 Query: 378 IVF-MVPVYALMSFLSLVLPQGSIYFNSIREIYEAWVIYNFLSLCLEWVGGPGSXI--LS 548 +VF + Y F SLV P S+ +R+ YE++ +Y F + +GG I + Sbjct: 38 LVFDHLSTYKNPEFASLVKPSISVDCGILRDCYESFAMYCFGRYLVACIGGEERTIEFME 97 Query: 549 LTGRV-LKPSW-------------FLXNCCLPPLALDGGFIR*CKQGCLQFVNFETHLVV 686 GR K F N L P L F + K G +Q++ ++ + Sbjct: 98 RQGRKSFKTPLLDHKDEKGIIKHPFPMNLFLKPWRLSPWFYQVVKFGIVQYMIIKSLTAL 157 Query: 687 VTLYFYAKGKYKDGNFFQXIIIVS*QSLYFFLTQWXXKALLXFXLACKXICFNNSSXPKF 866 L A G Y +G F + F W L+ F A K + KF Sbjct: 158 TALILEAFGVYCEGEFKWGCGYPYLAVVLNFSQSWALYCLVQFYGATKDELAHIQPLAKF 217 Query: 867 XXXKIVVFPTYGXGVLFPL 923 K +VF T+ GV L Sbjct: 218 LTFKSIVFLTWWQGVAIAL 236 Score = 43 (15.1 bits), Expect = 1.8, Sum P(2) = 0.83 Identities = 7/24 (29%), Positives = 13/24 (54%), Frame = +3 Query: 300 IALSLLHIYKHLLNYTEPTYQRFI 371 ++LSL ++ HL Y P + + Sbjct: 32 LSLSLFLVFDHLSTYKNPEFASLV 55 >gi|7020081|dbj|BAA90988.1| (AK000169) unnamed protein product [Homo sapiens] Length = 313 Frame 3 hits (HSPs): ___________________ __________________________________________________ Database sequence: | | | | | | | | 313 0 50 100 150 200 250 300 Plus Strand HSPs: Score = 89 (31.3 bits), Expect = 2.2, P = 0.89 Identities = 32/111 (28%), Positives = 47/111 (42%), Frame = +3 Query: 591 CCLPPLALDGGFIR*CKQGCLQFVNFETHLVVVTLYFYAKGKYKDGNF-----FQXIIIV 755 CC PP A+ + CK G LQ+ +V L G Y +GNF + ++I+ Sbjct: 32 CCCPPWAMGEVLLFRCKLGVLQYTVVRPFTTIVALICELLGIYDEGNFSFSNAWTYLVII 91 Query: 756 S*QSLYFFLTQWXXKALLXFXLACKXICFNNSSXPKFXXXKIVVFPTYGXGVLFPL 923 + S F + LL F K KF K+VVF ++ V+ L Sbjct: 92 NNMSQLFAMY-----CLLPFYKVLKEELSPIQPVGKFLCVKLVVFVSFWQAVVIAL 142 >gi|7707666|dbj|BAA95343.1| (AB027560) ATPase subunit 6 [Echinococcus vogeli] Length = 170 Frame 3 hits (HSPs): _________________________________________ __________________________________________________ Database sequence: | | | | | 170 0 50 100 150 Plus Strand HSPs: Score = 83 (29.2 bits), Expect = 3.0, P = 0.95 Identities = 42/147 (28%), Positives = 69/147 (46%), Frame = +3 Query: 264 LYIVAFICTCGAIALSLLHIYKHLLNYTEPTYQRFIVRIVFMVPVYALMSFLSLVLPQGS 443 +Y++ F C + L+ + L Y P F + VF+V V +M F+SL L + Sbjct: 14 IYVLVFNCVSYYYFVLLVFVLMWFLIYRLPYCYSFYLFSVFLVGVVFVM-FVSLFLCR-- 70 Query: 444 IYFNSIREIYEAWV-IYNFLSLC-LEWVGGPGSXILSLTGRVLKPSWFLXNCCLPPLALD 617 F+S+ + ++V + L +C L V S I+ +L+P + CL +AL Sbjct: 71 -VFSSVSSFFASFVPLGTPLYICFLVCVAETISYIIRPVVLILRPFINISLGCLGAVAL- 128 Query: 618 GGFIR*CKQGCLQFVNFETHLVVVTLYFY 704 G L FV++ LV+V L+FY Sbjct: 129 ---------GNLCFVSWWWSLVLVGLFFY 148 >gi|625591|pir||D61399 hypothetical early protein E8 - bovine papillomavirus type 3 Length = 75 Frame 3 hits (HSPs): _________________________ Annotated Domains: ____________________________ __________________________________________________ Database sequence: | | | | | 75 0 20 40 60 __________________ Annotated Domains: DOMO DM04650: 34..75 __________________ Plus Strand HSPs: Score = 66 (23.2 bits), Expect = 3.8, P = 0.98 Identities = 19/37 (51%), Positives = 22/37 (59%), Frame = +3 Query: 411 SFLSLVLPQGSIYFNSIREIYEA---WVIYNFLSLCL 512 S LSL P GSI S+ IY WV ++FLSLCL Sbjct: 20 SSLSLHGPLGSICIMSLTLIYWLLLLWVSFHFLSLCL 56 Parameters: filter=none matrix=BLOSUM62 V=50 B=50 E=10 gi H=1 sort_by_pvalue echofilter ctxfactor=5.97 Query ----- As Used ----- ----- Computed ---- Frame MatID Matrix name Lambda K H Lambda K H Std. 0 BLOSUM62 0.318 0.135 0.401 +3 0 BLOSUM62 0.318 0.135 0.401 0.340 0.153 0.510 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +2 0 BLOSUM62 0.318 0.135 0.401 0.347 0.152 0.573 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a +1 0 BLOSUM62 0.318 0.135 0.401 0.346 0.152 0.527 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -1 0 BLOSUM62 0.318 0.135 0.401 0.352 0.154 0.565 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -2 0 BLOSUM62 0.318 0.135 0.401 0.361 0.165 0.583 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a -3 0 BLOSUM62 0.318 0.135 0.401 0.353 0.157 0.550 Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a Query Frame MatID Length Eff.Length E S W T X E2 S2 +3 0 309 295 10. 78 3 12 22 0.11 36 33 0.10 40 +2 0 309 294 10. 78 3 12 22 0.10 36 33 0.10 40 +1 0 310 296 10. 78 3 12 22 0.11 36 33 0.10 40 -1 0 310 297 10. 78 3 12 22 0.11 36 33 0.10 40 -2 0 309 296 10. 78 3 12 22 0.11 36 33 0.10 40 -3 0 309 294 10. 78 3 12 22 0.10 36 33 0.10 40 Statistics: Database: /usr/local/dot5/sl_home/beauty/seqdb/blast/nr Title: nr Release date: unknown Posted date: 8:50 PM CDT May 27, 2000 Format: BLAST # of letters in database: 158,518,215 # of sequences in database: 505,245 # of database sequences satisfying E: 19 No. of states in DFA: 600 (59 KB) Total size of DFA: 295 KB (320 KB) Time to generate neighborhood: 0.02u 0.01s 0.03t Elapsed: 00:00:00 No. of threads or processors used: 4 Search cpu time: 384.93u 1.46s 386.39t Elapsed: 00:02:48 Total cpu time: 385.02u 1.49s 386.51t Elapsed: 00:02:48 Start: Mon Oct 16 19:22:23 2000 End: Mon Oct 16 19:25:11 2000
Annotated Domains Database: March 14, 2000
Release Date: March 14, 2000