BLASTX 2.2.6 [Apr-09-2003]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= 3802485.2.1
(1118 letters)
Database: nr
3,454,138 sequences; 1,185,965,366 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_909102.1| unknown protein [Oryza sativa (japonica cu... 557 e-157
ref|NP_177234.1| unknown protein [Arabidopsis thaliana] >gi... 383 e-105
ref|NP_173730.1| unknown protein [Arabidopsis thaliana] 329 1e-88
ref|NP_566403.1| unknown protein [Arabidopsis thaliana] >gi... 251 3e-65
gb|AAF23204.1| unknown protein [Arabidopsis thaliana] 251 3e-65
gb|AAC00602.1| Unknown protein [Arabidopsis thaliana] 245 2e-63
gb|AAU44392.1| hypothetical protein AT1G23170 [Arabidopsis ... 238 3e-61
gb|AAH90099.1| LOC548392 protein [Xenopus tropicalis] 49 3e-04
ref|XP_700034.1| PREDICTED: hypothetical protein XP_694942 ... 44 0.012
emb|CAG12945.1| unnamed protein product [Tetraodon nigrovir... 44 0.016
gb|AAH54691.1| LOC402840 protein [Danio rerio] 43 0.020
ref|XP_796011.1| PREDICTED: similar to CG33129-PE, isoform ... 40 0.17
ref|XP_426216.1| PREDICTED: similar to FLJ20254 protein [Ga... 40 0.17
ref|XP_514113.1| PREDICTED: similar to ubiquitin specific p... 38 0.85
ref|XP_474995.1| OSJNBa0065B15.15 [Oryza sativa (japonica c... 37 1.9
gb|EAA14245.3| ENSANGP00000015679 [Anopheles gambiae str. P... 36 3.2
ref|XP_750113.1| GPI anchored protein [Aspergillus fumigatu... 35 7.2
>ref|NP_909102.1| unknown protein [Oryza sativa (japonica cultivar-group)]
dbj|BAB03379.1| unknown protein [Oryza sativa (japonica cultivar-group)]
Length = 586
Score = 557 bits (1435), Expect = e-157
Identities = 276/333 (82%), Positives = 298/333 (89%)
Frame = +2
Query: 5 ISSSYENQQDIQLMRFADYFGRAFVAVSASQFAWAKMFKESTVSKMVDIPLCHIPEAVIK 184
IS SYENQQDIQLMRFADYFGR+F +VSA+QF WAKMFKES VSKMVDIPLCHIPE V
Sbjct: 154 ISESYENQQDIQLMRFADYFGRSFASVSAAQFPWAKMFKESLVSKMVDIPLCHIPEPVRN 213
Query: 185 TASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQSPRAQVAIFVVLAMT 364
TASDWI+QRS DALGDFV+WCIDSIMSELSG + G KGSKK AQQ+PRAQVAIFVVLA+T
Sbjct: 214 TASDWINQRSPDALGDFVMWCIDSIMSELSGQAVGAKGSKKAAQQTPRAQVAIFVVLALT 273
Query: 365 LRRKPDVLVNVMPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVSGMFCWAHSLFPTLCAK 544
+RRKP+VL NV+PKIMGNNKYLGQEKLPIIVWVIAQASQGDLV+GMFCWAH LFPTLCAK
Sbjct: 274 VRRKPEVLTNVLPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVTGMFCWAHFLFPTLCAK 333
Query: 545 SSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDLFMRATFPVSNARV 724
SGNP RDLVLQLLERILS KAR ILLNGAVRKGERL+PPV+FDLFMRA FPVS+ARV
Sbjct: 334 PSGNPQTRDLVLQLLERILSAPKARGILLNGAVRKGERLIPPVTFDLFMRAAFPVSSARV 393
Query: 725 KATERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQENNAELTREAVDVFIWC 904
KATERFEAAYP IKELALAGPPGSKTVKQA+QQLLPLC KAMQENNA+LT E+ VFIWC
Sbjct: 394 KATERFEAAYPTIKELALAGPPGSKTVKQAAQQLLPLCVKAMQENNADLTGESAGVFIWC 453
Query: 905 LTQNAESYKXXERIYLENIEAXXAVLSKVVIDW 1003
LTQNAESYK ER++ EN+EA VLS +V W
Sbjct: 454 LTQNAESYKLWERLHPENVEASVVVLSTIVTKW 486
>ref|NP_177234.1| unknown protein [Arabidopsis thaliana]
gb|AAP37737.1| At1g70770 [Arabidopsis thaliana]
gb|AAM97096.1| unknown protein [Arabidopsis thaliana]
gb|AAG52333.1| unknown protein; 13405-15968 [Arabidopsis thaliana]
gb|AAD55492.1| Unknown protein [Arabidopsis thaliana]
Length = 610
Score = 383 bits (983), Expect = e-105
Identities = 188/334 (56%), Positives = 246/334 (73%), Gaps = 1/334 (0%)
Frame = +2
Query: 8 SSSYENQQDIQLMRFADYFGRAFVAVSASQFAWAKMFKESTVSKMVDIPLCHIPEAVIKT 187
S SY +Q +IQLMRFADYFGRA VS+ QF W KMFKES +SK++++PL HIPE V KT
Sbjct: 159 SESYASQPEIQLMRFADYFGRALSGVSSVQFPWVKMFKESPLSKLIEVPLAHIPEPVYKT 218
Query: 188 ASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQSP-RAQVAIFVVLAMT 364
+ DWI+ R +ALG FVLW D I+++L+ G KG KK QQ+ ++QVAIFV LAM
Sbjct: 219 SVDWINHRPIEALGAFVLWAFDCILTDLAAQQGGAKGGKKGGQQTTSKSQVAIFVALAMV 278
Query: 365 LRRKPDVLVNVMPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVSGMFCWAHSLFPTLCAK 544
LRRKPD L NV+P + N KY GQ+KLP+ VW++AQASQGD+ G++ WAH+L P + K
Sbjct: 279 LRRKPDALTNVLPTLRENPKYQGQDKLPVTVWMMAQASQGDIAVGLYSWAHNLLPVVGNK 338
Query: 545 SSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDLFMRATFPVSNARV 724
+ NP +RDL+LQL+E+IL+ KAR+IL+NGAVRKGERL+PP SF++ +R TFP S+ARV
Sbjct: 339 NC-NPQSRDLILQLVEKILTNPKARTILVNGAVRKGERLIPPPSFEILLRLTFPASSARV 397
Query: 725 KATERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQENNAELTREAVDVFIWC 904
KATERFEA YP++KE+ALAG PGSK +KQ +QQ+ K E N L +EA + IW
Sbjct: 398 KATERFEAIYPLLKEVALAGAPGSKAMKQVTQQIFTFALKLAGEGNPVLAKEATAIAIWS 457
Query: 905 LTQNAESYKXXERIYLENIEAXXAVLSKVVIDWR 1006
+TQN + K + +Y EN+EA AVL K+V +W+
Sbjct: 458 VTQNFDCCKHWDNLYKENLEASVAVLKKLVEEWK 491
>ref|NP_173730.1| unknown protein [Arabidopsis thaliana]
Length = 569
Score = 329 bits (844), Expect = 1e-88
Identities = 162/287 (56%), Positives = 216/287 (75%), Gaps = 1/287 (0%)
Frame = +2
Query: 149 IPLCHIPEAVIKTASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQ-SP 325
IPL HIPEAV KT++DWI+QR +ALG FVLW +D I+++L+ G KG KK AQQ S
Sbjct: 170 IPLSHIPEAVYKTSADWINQRPIEALGAFVLWGLDCILADLAVQQGGVKGGKKGAQQASS 229
Query: 326 RAQVAIFVVLAMTLRRKPDVLVNVMPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVSGMF 505
++QVAIFV +AM LR+KPD L N++P + N KY GQ+KLP+ VW++AQASQGD+ G++
Sbjct: 230 KSQVAIFVAVAMVLRKKPDALTNILPTLRENPKYQGQDKLPVTVWMMAQASQGDISVGLY 289
Query: 506 CWAHSLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDL 685
WAH+L P + +KS NP +RDL+LQL+ERILS KAR+IL+NGAVRKGERL+PP SF++
Sbjct: 290 SWAHNLLPVVSSKSC-NPQSRDLILQLVERILSNPKARTILVNGAVRKGERLIPPPSFEI 348
Query: 686 FMRATFPVSNARVKATERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQENNA 865
+R TFP S+ARVKATERFEA YP++KE++LAG PGSK +KQ +QQ+ KA E N
Sbjct: 349 LVRLTFPASSARVKATERFEAIYPLLKEVSLAGAPGSKAMKQVTQQIFTFALKAAGEENP 408
Query: 866 ELTREAVDVFIWCLTQNAESYKXXERIYLENIEAXXAVLSKVVIDWR 1006
L +EA + IW LTQN + K E +Y +N++A AVL K++ +W+
Sbjct: 409 LLAKEAAAITIWALTQNVDCCKHWENLYTDNLKASVAVLKKLIGEWK 455
>ref|NP_566403.1| unknown protein [Arabidopsis thaliana]
gb|AAL16283.1| AT3g11880/F26K24_17 [Arabidopsis thaliana]
Length = 443
Score = 251 bits (642), Expect = 3e-65
Identities = 143/338 (42%), Positives = 208/338 (61%), Gaps = 8/338 (2%)
Frame = +2
Query: 5 ISSSYENQQDIQLMRFADYFGRAFVAVSASQFAWAKMFKESTVSK---MVDIPLCHIPEA 175
IS S+ + QL++F DY + +S+ Q+ W MFK S K M+D+PL HIP +
Sbjct: 27 ISKSHAFVPEEQLLKFVDYLE---IKLSSVQYLWLDMFKGSPCPKLIDMIDVPLSHIPVS 83
Query: 176 VIKTASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQSPRAQVAIFVVL 355
V T+ +W+ + S L FV+W ++ +++ L P G G ++ + + + VA+FV L
Sbjct: 84 VYDTSVEWLDKFSIGLLCAFVVWSLNRLLTILEPPQQG--GHQR--RTTSKFHVAVFVAL 139
Query: 356 AMTLRRKPDVLVNVMPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVSGMFCWAHSLFPT- 532
AM LR +P+ LV V+P + ++Y G +KLPI+VW++AQASQGDL G++ W+ +L P
Sbjct: 140 AMVLRNEPNTLVIVLPTLK-EDEYQGHDKLPILVWMMAQASQGDLSVGLYSWSCNLLPVF 198
Query: 533 ----LCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDLFMRAT 700
L S N + DL+LQL E ILS AR+IL+NG V +RL+ P +F+L MR T
Sbjct: 199 YQENLLPVSRSNSQSMDLILQLAEMILSNLDARTILVNGTVIDKQRLISPYAFELLMRLT 258
Query: 701 FPVSNARVKATERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQENNAELTRE 880
FP S+ RVKATERFEA YP++KE+ALA PGS+ +KQ +QQ+ N L +E
Sbjct: 259 FPASSERVKATERFEAIYPLLKEVALACEPGSELMKQVTQQIFHYSLIIAGRRNLVLAKE 318
Query: 881 AVDVFIWCLTQNAESYKXXERIYLENIEAXXAVLSKVV 994
A + +W LT+N + K E++Y EN EA AVL K+V
Sbjct: 319 ATAIAVWSLTENVDCCKQWEKLYWENKEASVAVLKKLV 356
>gb|AAF23204.1| unknown protein [Arabidopsis thaliana]
Length = 459
Score = 251 bits (642), Expect = 3e-65
Identities = 143/338 (42%), Positives = 208/338 (61%), Gaps = 8/338 (2%)
Frame = +2
Query: 5 ISSSYENQQDIQLMRFADYFGRAFVAVSASQFAWAKMFKESTVSK---MVDIPLCHIPEA 175
IS S+ + QL++F DY + +S+ Q+ W MFK S K M+D+PL HIP +
Sbjct: 43 ISKSHAFVPEEQLLKFVDYLE---IKLSSVQYLWLDMFKGSPCPKLIDMIDVPLSHIPVS 99
Query: 176 VIKTASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQSPRAQVAIFVVL 355
V T+ +W+ + S L FV+W ++ +++ L P G G ++ + + + VA+FV L
Sbjct: 100 VYDTSVEWLDKFSIGLLCAFVVWSLNRLLTILEPPQQG--GHQR--RTTSKFHVAVFVAL 155
Query: 356 AMTLRRKPDVLVNVMPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVSGMFCWAHSLFPT- 532
AM LR +P+ LV V+P + ++Y G +KLPI+VW++AQASQGDL G++ W+ +L P
Sbjct: 156 AMVLRNEPNTLVIVLPTLK-EDEYQGHDKLPILVWMMAQASQGDLSVGLYSWSCNLLPVF 214
Query: 533 ----LCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDLFMRAT 700
L S N + DL+LQL E ILS AR+IL+NG V +RL+ P +F+L MR T
Sbjct: 215 YQENLLPVSRSNSQSMDLILQLAEMILSNLDARTILVNGTVIDKQRLISPYAFELLMRLT 274
Query: 701 FPVSNARVKATERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQENNAELTRE 880
FP S+ RVKATERFEA YP++KE+ALA PGS+ +KQ +QQ+ N L +E
Sbjct: 275 FPASSERVKATERFEAIYPLLKEVALACEPGSELMKQVTQQIFHYSLIIAGRRNLVLAKE 334
Query: 881 AVDVFIWCLTQNAESYKXXERIYLENIEAXXAVLSKVV 994
A + +W LT+N + K E++Y EN EA AVL K+V
Sbjct: 335 ATAIAVWSLTENVDCCKQWEKLYWENKEASVAVLKKLV 372
>gb|AAC00602.1| Unknown protein [Arabidopsis thaliana]
Length = 1299
Score = 245 bits (626), Expect = 2e-63
Identities = 131/264 (49%), Positives = 182/264 (68%), Gaps = 4/264 (1%)
Frame = +2
Query: 149 IPLCHIPEAVIKTASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQ-SP 325
IPL HIPEAV KT++DWI+QR +ALG FVLW +D I+++L+ G KG KK AQQ S
Sbjct: 170 IPLSHIPEAVYKTSADWINQRPIEALGAFVLWGLDCILADLAVQQGGVKGGKKGAQQASS 229
Query: 326 RAQVAIFVVLAMTLRRKPDVLVNVMPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVSGMF 505
++QVAIFV +AM LR+KPD L N++P + N KY GQ+KLP+ VW++AQASQGD+ G++
Sbjct: 230 KSQVAIFVAVAMVLRKKPDALTNILPTLRENPKYQGQDKLPVTVWMMAQASQGDISVGLY 289
Query: 506 CWAHSLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDL 685
WAH+L P + +KS NP +RDL+LQL+ERILS KAR+IL+NGAVRKGERL+PP SF++
Sbjct: 290 SWAHNLLPVVSSKSC-NPQSRDLILQLVERILSNPKARTILVNGAVRKGERLIPPPSFEI 348
Query: 686 FMRATFPVSNARVKA---TERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQE 856
+R TFP S+ARVK T+ +A+ ++K+L S + A L K++++
Sbjct: 349 LVRLTFPASSARVKENLYTDNLKASVAVLKKLIGEWKERSVKLTPAETLTLNQTMKSLRQ 408
Query: 857 NNAELTREAVDVFIWCLTQNAESY 928
N E E + L ++A+ Y
Sbjct: 409 KNEEALTEGGNGVSQSLYKDADKY 432
>gb|AAU44392.1| hypothetical protein AT1G23170 [Arabidopsis thaliana]
Length = 375
Score = 238 bits (607), Expect = 3e-61
Identities = 117/203 (57%), Positives = 155/203 (76%), Gaps = 1/203 (0%)
Frame = +2
Query: 8 SSSYENQQDIQLMRFADYFGRAFVAVSASQFAWAKMFKESTVSKMVDIPLCHIPEAVIKT 187
S SY +Q +IQLM+FADYFGR+ VS++ F W K FKES +SK++DIPL HIPEAV KT
Sbjct: 169 SESYASQPEIQLMKFADYFGRSLSQVSSAHFPWVKTFKESPLSKLIDIPLSHIPEAVYKT 228
Query: 188 ASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQ-SPRAQVAIFVVLAMT 364
++DWI+QR +ALG FVLW +D I+++L+ G KG KK AQQ S ++QVAIFV +AM
Sbjct: 229 SADWINQRPIEALGAFVLWGLDCILADLAVQQGGVKGGKKGAQQASSKSQVAIFVAVAMV 288
Query: 365 LRRKPDVLVNVMPKIMGNNKYLGQEKLPIIVWVIAQASQGDLVSGMFCWAHSLFPTLCAK 544
LR+KPD L N++P + N KY GQ+KLP+ VW++AQASQGD+ G++ WAH+L P + +K
Sbjct: 289 LRKKPDALTNILPTLRENPKYQGQDKLPVTVWMMAQASQGDISVGLYSWAHNLLPVVSSK 348
Query: 545 SSGNPLARDLVLQLLERILSVTK 613
S NP +RDL+LQL+ERILS K
Sbjct: 349 SC-NPQSRDLILQLVERILSNPK 370
>gb|AAH90099.1| LOC548392 protein [Xenopus tropicalis]
Length = 442
Score = 49.3 bits (116), Expect = 3e-04
Identities = 45/187 (24%), Positives = 77/187 (41%), Gaps = 3/187 (1%)
Frame = +2
Query: 452 IVWVIAQASQGDLVSGMFCWAHSLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILL 631
++W + QA DL G+ W +FP L K+ +P A + L+R+L L
Sbjct: 14 VMWAVGQAGFTDLAEGLKVWLGLMFPVLGVKNL-SPYA----ILYLDRLL--------LA 60
Query: 632 NGAVRKGERLVPPVSFDLFMRATFPVSNARVKATERFEAAYPMIKELALAGPPGSKT--- 802
+ + KG + P F + A P ++ E + YP +K LA P S
Sbjct: 61 HSNLTKGFGMGPKDFFPILDFAFMPNNSLTPSQQENLRSLYPRLKVLAFGANPESTLHTY 120
Query: 803 VKQASQQLLPLCAKAMQENNAELTREAVDVFIWCLTQNAESYKXXERIYLENIEAXXAVL 982
+ P C AM++ EL + D CL ++ S+ ++Y +++ +L
Sbjct: 121 FPSFLSRATPSCPAAMRK---ELIQSLSD----CLNKDPLSFSVWRQLYTKHLAQSSLLL 173
Query: 983 SKVVIDW 1003
+V W
Sbjct: 174 QHLVETW 180
>ref|XP_700034.1| PREDICTED: hypothetical protein XP_694942 [Danio rerio]
Length = 430
Score = 43.9 bits (102), Expect = 0.012
Identities = 44/189 (23%), Positives = 79/189 (41%), Gaps = 4/189 (2%)
Frame = +2
Query: 452 IVWVIAQASQGDLVSGMFCWAHSLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILL 631
I+W + QA DL G+ W + P L K+ + LER+L+ L
Sbjct: 145 IMWALGQAGFYDLSQGIRVWLGIMLPVLGMKALSA-----YAIAYLERLLT--------L 191
Query: 632 NGAVRKGERLVPPVSFDLFMRATFPVSNARVKAT-ERFEAAYPMIKELALAGPPGSKT-- 802
+ + KG ++ P F + + NA ++ E+ YP IK LA P S
Sbjct: 192 HANLTKGFGIMGPKEFFPLLDFAYMPKNALSQSLQEQLCRLYPRIKVLAFGAKPESTLHT 251
Query: 803 -VKQASQQLLPLCAKAMQENNAELTREAVDVFIWCLTQNAESYKXXERIYLENIEAXXAV 979
+ P C AM++ EL R + CL+ +++S ++Y +++ +
Sbjct: 252 YFPSFLSRATPNCPGAMKK---ELLRSLTE----CLSVDSQSLSVWRQLYTKHLPQSSLL 304
Query: 980 LSKVVIDWR 1006
L+ ++ W+
Sbjct: 305 LNHLLKTWK 313
>emb|CAG12945.1| unnamed protein product [Tetraodon nigroviridis]
Length = 637
Score = 43.5 bits (101), Expect = 0.016
Identities = 47/222 (21%), Positives = 86/222 (38%), Gaps = 3/222 (1%)
Frame = +2
Query: 347 VVLAMTLRRKPDVLVNVMPKIMGNNKYLGQE--KLPIIVWVIAQASQGDLVSGMFCWAHS 520
V + L+ KP + +P + + + K I+W + QA DL G+ W
Sbjct: 175 VCIQAILQDKPKIATQNLPMYLELLRSVQNRPVKCLTIMWALGQAGFCDLSQGLRVWLGI 234
Query: 521 LFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDLFMRAT 700
+ P L K+ + LER+L LL+ + KG ++ P F +
Sbjct: 235 MLPVLGVKALSA-----YAIAYLERLL--------LLHTNLTKGFGILGPKEFFPLLDFA 281
Query: 701 FPVSNARVKAT-ERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQENNAELTR 877
F NA + E+ YP IK L+ G+K L ++A ++ +
Sbjct: 282 FMPKNALSPSLQEQLRRLYPRIKVLSF----GAKPESTLHTYLPSFLSRATPHCPEDMKK 337
Query: 878 EAVDVFIWCLTQNAESYKXXERIYLENIEAXXAVLSKVVIDW 1003
E + CL + +S ++Y +++ +L ++ W
Sbjct: 338 ELLGSMTECLCVDVQSLGVWRQLYTKHLAQSSLLLKHLLKSW 379
>gb|AAH54691.1| LOC402840 protein [Danio rerio]
Length = 304
Score = 43.1 bits (100), Expect = 0.020
Identities = 43/189 (22%), Positives = 79/189 (41%), Gaps = 4/189 (2%)
Frame = +2
Query: 452 IVWVIAQASQGDLVSGMFCWAHSLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILL 631
I+W + QA DL G+ W + P L K+ + LER+L+ L
Sbjct: 46 IMWALGQAGFYDLSQGIRVWLGIMLPVLGMKALSA-----YAIAYLERLLT--------L 92
Query: 632 NGAVRKGERLVPPVSFDLFMRATFPVSNARVKAT-ERFEAAYPMIKELALAGPPGSKT-- 802
+ + KG ++ P F + + NA ++ E+ YP +K LA P S
Sbjct: 93 HANLTKGFGIMGPKEFFPLLDFAYMPKNALSQSLQEQLCRLYPRLKVLAFGAKPESTLHT 152
Query: 803 -VKQASQQLLPLCAKAMQENNAELTREAVDVFIWCLTQNAESYKXXERIYLENIEAXXAV 979
+ P C AM++ EL R + CL+ +++S ++Y +++ +
Sbjct: 153 YFPPFLSRATPSCPGAMKK---ELLRSLTE----CLSVDSQSLSVWRQLYTKHLPQSSLL 205
Query: 980 LSKVVIDWR 1006
L+ ++ W+
Sbjct: 206 LNHLLKTWK 214
>ref|XP_796011.1| PREDICTED: similar to CG33129-PE, isoform E, partial
[Strongylocentrotus purpuratus]
Length = 509
Score = 40.0 bits (92), Expect = 0.17
Identities = 56/293 (19%), Positives = 105/293 (35%), Gaps = 4/293 (1%)
Frame = +2
Query: 137 KMVDIPLCHIPEAVIKTASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQ 316
K D PLC + +V K + + L F CI + ++L PKG+
Sbjct: 241 KPSDYPLCKMNSSVRKLLKNLMETYPERMLSMFFQHCIREVETDL------PKGN----- 289
Query: 317 QSPRAQVAIFVVLAMTLRRKPDVLVNVMPKIMGNNKYLGQEKLP--IIVWVIAQASQGDL 490
A + L + P + N + + K ++ I+W QA Q D
Sbjct: 290 ----AVNGYRIFLQLLAVEYPQFVTNRISDYLEVFKRRQSDRATCLTILWGCIQAGQHDP 345
Query: 491 VSGMFCWAHSLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPP 670
V G+ W+ + P L G+ + + L+R L R + L P
Sbjct: 346 VIGLQVWSKLMLPLL-----GHKMVSPYAISTLDRFLGQKLDEK-------RASQVLGPN 393
Query: 671 VSFDLFMRATFPVSNARVKATERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAM 850
F + P ++ + ++ YP +K LA P S ++ LL
Sbjct: 394 EFFPILDYVFTPNNSLQPNLQKQLLGHYPRLKRLAFRENPES-NLRNFFPSLLARTTDHW 452
Query: 851 QENNAEL--TREAVDVFIWCLTQNAESYKXXERIYLENIEAXXAVLSKVVIDW 1003
N L ++ ++CL Q+ + ++Y +++ +++ ++ W
Sbjct: 453 PAINPSLPWVLHLLECLVFCLCQDQHCFSEWRQMYDSHMKQSSLLMNHIIKVW 505
>ref|XP_426216.1| PREDICTED: similar to FLJ20254 protein [Gallus gallus]
Length = 592
Score = 40.0 bits (92), Expect = 0.17
Identities = 59/278 (21%), Positives = 106/278 (38%), Gaps = 6/278 (2%)
Frame = +2
Query: 146 DIPLCHIPEAVIKTASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQSP 325
D P C + + +K A + +SS L F CI +++ EL ++ Q+
Sbjct: 53 DYPYCLVSKE-LKNAIRSLLGKSSGVLELFFDHCIYTMLQELDKTPGESLHGYRICIQA- 110
Query: 326 RAQVAIFVVLAMTLRRKPDVLVNVMPKIMG--NNKYLGQEKLPIIVWVIAQASQGDLVSG 499
L +P + + K + + K I+W + QA DL G
Sbjct: 111 ------------VLLERPKIATTNLGKYLELLRSHQNRPAKCLTILWALGQAGFTDLAEG 158
Query: 500 MFCWAHSLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSF 679
+ W + P L K+ +P A + L+R+L+V + + KG ++ P F
Sbjct: 159 LRVWLGVMLPVLGIKAL-SPYA----VSYLDRLLTV--------HPNLTKGFGMIGPKDF 205
Query: 680 -DLFMRATFPVSNARVKATERFEAAYPMIKELALAGPPGSKT---VKQASQQLLPLCAKA 847
L A P ++ E+ YP +K LAL P + + P C A
Sbjct: 206 FPLLDFAFMPNNSLSPSLQEQLRRLYPRLKVLALGARPETTLHTYFPSFLSRATPSCPPA 265
Query: 848 MQENNAELTREAVDVFIWCLTQNAESYKXXERIYLENI 961
M+ +E + CL+ + S+ ++Y +++
Sbjct: 266 MR-------KELLTSMSQCLSVDPLSFSVWRQLYTKHL 296
>ref|XP_514113.1| PREDICTED: similar to ubiquitin specific proteinase 43 [Pan
troglodytes]
Length = 428
Score = 37.7 bits (86), Expect = 0.85
Identities = 20/51 (39%), Positives = 27/51 (52%)
Frame = +2
Query: 173 AVIKTASDWISQRSSDALGDFVLWCIDSIMSELSGPSAGPKGSKKVAQQSP 325
AV K S + DAL +F+LW +D + +L G S GP K V+ Q P
Sbjct: 153 AVSKYGSQFQGNSQHDAL-EFLLWLLDRVHEDLEGSSRGPVSEKTVSAQMP 202
>ref|XP_474995.1| OSJNBa0065B15.15 [Oryza sativa (japonica cultivar-group)]
emb|CAE05813.1| OSJNBa0028M15.5 [Oryza sativa (japonica cultivar-group)]
emb|CAD39911.2| OSJNBa0065B15.15 [Oryza sativa (japonica cultivar-group)]
Length = 1055
Score = 36.6 bits (83), Expect = 1.9
Identities = 33/134 (24%), Positives = 58/134 (43%), Gaps = 9/134 (6%)
Frame = +2
Query: 518 SLFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDLFMRA 697
+ F LC+K+ L +D+++Q+ E I+ + + K E++ PP FD+ M
Sbjct: 629 NFFKRLCSKT----LNKDVLVQMNEEIIVL-----------LCKLEKIFPPALFDVMMHL 673
Query: 698 TFP-VSNARVKATERFEAAYPMIKELAL--------AGPPGSKTVKQASQQLLPLCAKAM 850
V A ++ ++ YP+ + L A P GS + + L C+K M
Sbjct: 674 PVHLVEEALLRGPVQYGWMYPIERRLYTLKRYVRNGARPEGSIAEAYIADECLTFCSKYM 733
Query: 851 QENNAELTREAVDV 892
+ REA +V
Sbjct: 734 DDVETRFNREARNV 747
>gb|EAA14245.3| ENSANGP00000015679 [Anopheles gambiae str. PEST]
ref|XP_318929.2| ENSANGP00000015679 [Anopheles gambiae str. PEST]
Length = 579
Score = 35.8 bits (81), Expect = 3.2
Identities = 45/217 (20%), Positives = 82/217 (37%), Gaps = 4/217 (1%)
Frame = +2
Query: 347 VVLAMTLRRKPDVLVNVMPK-IMGNNKYLGQEKLPI-IVWVIAQASQGDLVSGMFCWAHS 520
V+L P VN + + + N Y + + + ++W + Q DL G+ W
Sbjct: 122 VILQAIAMHYPSACVNNLARNAILRNSYQNRHNIGLSLLWALGQGGYNDLDVGLKVWQDI 181
Query: 521 LFPTLCAKSSGNPLARDLVLQLLERILSVTKARSILLNGAVRKGERLVPPVSFDLFMRAT 700
+ P + K+ N D V+ RIL + +A + L G+ F+
Sbjct: 182 MVPVMELKNY-NRFTSDYVV----RILRLHRAHRLTLGGSE--------------FLTIL 222
Query: 701 FPVSNARVKATERFEAAYPMIKELALAGPPGSKTVKQASQQLLPLCAKAMQENNAELTRE 880
++ E EAA +++ + P S T + +N + +TR
Sbjct: 223 SSLTTQPKACRELDEAAQLLVERYVFSAPKASATFTM------------LFKNVSFITRP 270
Query: 881 AVDVF--IWCLTQNAESYKXXERIYLENIEAXXAVLS 985
+ + CL ++ ES +Y N+E A+LS
Sbjct: 271 EMIYYGLALCLLEDPESAAVWLGLYRSNVETSLAILS 307
>ref|XP_750113.1| GPI anchored protein [Aspergillus fumigatus Af293]
gb|EAL88075.1| GPI anchored protein, putative [Aspergillus fumigatus Af293]
Length = 250
Score = 34.7 bits (78), Expect = 7.2
Identities = 26/113 (23%), Positives = 37/113 (32%), Gaps = 7/113 (6%)
Frame = +2
Query: 8 SSSYENQQDIQLMRFADYFGRAFVAVSASQFAWAKMFKESTVSKMVDIPLCHIPEAVIKT 187
+ SY L ++G V Q+ W+ F S K D P + T
Sbjct: 84 TGSYSWTPSTDLENDVTHYGLLLVVEGTGQYQWSTQFGISNPGKAADTPAASVTATTSAT 143
Query: 188 ASDWISQRSSDALGDFVL-------WCIDSIMSELSGPSAGPKGSKKVAQQSP 325
+ SS A + L WC +S S P P G+ + SP
Sbjct: 144 SEVETPAASSPADSNVTLVTTETTTWCPESTAKPTSIPVIVPTGAPSIPSGSP 196
Database: nr
Posted date: Apr 6, 2006 2:41 PM
Number of letters in database: 1,185,965,366
Number of sequences in database: 3,454,138
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,214,894,810
Number of Sequences: 3454138
Number of extensions: 48016200
Number of successful extensions: 125876
Number of sequences better than 10.0: 17
Number of HSP's better than 10.0 without gapping: 120200
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 125836
length of database: 1,185,965,366
effective HSP length: 132
effective length of database: 730,019,150
effective search space used: 175204596000
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)