Details for: kst

Gene ID: 38418

Symbol: kst

Ensembl ID: FBgn0004167

Description: karst

Associated with

Other Information

Genular Protein ID: 4176023069

Symbol: M9PBL6_DROME

Name: N/A

UniProtKB Accession Codes:

Database IDs:

Citations:

PubMed ID: 10731132

Title: The genome sequence of Drosophila melanogaster.

PubMed ID: 10731132

DOI: 10.1126/science.287.5461.2185

PubMed ID: 12537568

Title: Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.

PubMed ID: 12537568

PubMed ID: 12537572

Title: Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.

PubMed ID: 12537572

DOI: 10.1186/gb-2002-3-12-research0083

PubMed ID: 12537573

Title: The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective.

PubMed ID: 12537573

PubMed ID: 12537574

Title: Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.

PubMed ID: 12537574

PubMed ID: 16110336

Title: Combined evidence annotation of transposable elements in genome sequences.

PubMed ID: 16110336

DOI: 10.1371/journal.pcbi.0010022

PubMed ID: 17569856

Title: The Release 5.1 annotation of Drosophila melanogaster heterochromatin.

PubMed ID: 17569856

DOI: 10.1126/science.1139815

PubMed ID: 17569867

Title: Sequence finishing and mapping of Drosophila melanogaster heterochromatin.

PubMed ID: 17569867

DOI: 10.1126/science.1139816

Sequence Information:

  • Length: 4321
  • Mass: 495964
  • Checksum: 76CF3205B74355AB
  • Sequence:
  • MEYNSVLRSN FSRNEYRRYI SYERQSLASQ YEPGGYSALQ TPTPSNRNSA NMTQRDGIIK 
    FENERIKTLQ EERLHIQKKT FTKWMNSFLI KAKMEVEDLF TDLADGIKLL KLLEIISSEK 
    LGKPNSGRMR VHKIENVNKS LAFLHTKVRL ESIGAEDIVD GNPRLILGLI WTIILRFQIQ 
    EIEIDVDEEN ESSEKRSAKD ALLLWCQRKT HGYPGVNITD FTNSWRSGLG FNALIHSHRP 
    DLFEYSTIVN SKNSNLDNLN HAFDTAANEL GIPSLLDAED IDSARPDEKS ILTYVASYYH 
    TFARMKNEQK SGKRIANIVG QLMDADRKKM QYEGLTTNLL SWIRQKTLEL EQRDLPNSLE 
    GIQRELLAFK EYRTIEKPPK YKERSEIEAL YFTINTLLKA LNQPPYNPQD GQLVNDIEKA 
    WQILEYAEHH REVALRDELL RQEKLEQLNY KFEKKSVLRE GYLKEMIQVL SDPRYLRQVD 
    ATLKKHEAIS ADILARVERF NDLTAMAEEL DRENYHGKER VRRREQEVMA KWRQLLELLE 
    NQRLNLSQMS NLMNLLREIA STTEAVRELQ QQFASEDVGP HLLGVEELLQ AHSLQELQVN 
    TYGETLKRFN RQALPYKSSE HKDAALLAQR LADLEEAYSE LLRRSAARRA RLEEARNFHH 
    FMEDYDNEES WLVDKQRICK TGITAKDLRA VLSLQQKHKA LEDEIKSRKP KSGQMSTAGK 
    RLIGEQHPRS SEIQSRIDSL AEHWQALEAL VELRRRQLED AAEAYQFYTD ANEAESWLNE 
    KIALVNSRDY GNDEPSAQAL LQRHRDLQGE LNAYSGDILN LNQQADKLIK AGICTLELSA 
    AEPELPEVEQ EEWVNETRLV PKEVWEDEWV EKLEHKKVTE TKMLPHVKSL FPFEGQGMKM 
    DKGEVMLLKS KTNDDWWCVR KDNGVEGFVP ANYVREVEPR PVACIVPKAE KVKSLQKVKK 
    TILVRQVVPV KRIKPVSVAP KPLVQRRTST QSINENADSV EKRQQRINQT YDELQEMAQK 
    RHALLEDSIH LFGFYRECDD FEKWMKEKER MIKSDEGEGV DNAKRKFEKF ITDLSAASKR 
    VEEIDGAVDT FRRQGHSQLD KIIARQRQIH QIWQRLNNAK AQREKSLEGA SSVELFNRTC 
    DEAKVWMSEK MLQLDTAVIT PDLRTVQALQ RRHQNLEREL APVEDKVNRV TYLGNSVKNA 
    YPAEKDNVNA RQQEVQDMWQ QVQQRGSDLR NRIESEVGQQ VFNNSAKVLL AWIDSVKDQL 
    NADESARDVE TANNLLKKHN DLGDDIRAHD TEFVEVIQLG KQLSDGKPNM AETVAVIERL 
    KAEQDAIHRG WAEKQKWLLQ CVDLQMFNRE ADKIDATTKS HEAFLEYNNL GASLDEVEAI 
    LKRHLDFEKS LMAQDKILKG FSDNADKLIS NDHYDSKYIG DRRNQVLGKR KAVKDRAFER 
    KRLLQASKDF HKFAAEADDL KVWLQDKTRI AGDENYRDLS NLPRKLQKHQ AFERELRANE 
    GQLRNVTKDG QALVQAGNRV PEVESRVADL NKRWKDLLTL SEDKGRKLEQ AASQREHNRS 
    LEDAKKKVDE LDSALRSGDV GNDLRSCKDL INKQQILESE ITIWDQKVAE LVSTGDDMAH 
    GGHFNAQNIE AGTKELQQRF KDLRDPTQRR RAKLEESLNY HKFVFELDSE FQWINEHLPA 
    AKSNELGQNL HQAQSLHKKH KKLEAEIKGH QPMINKALVA GQSLISQQHP EREQVESLCQ 
    QLEQAWQDLE RHCGERSRKL DMSLKAQQYL FDAGEIESWL GERNNVLRST EYGRDRDSAA 
    KLLTKHKTIE LELDTYSGIV TEMGHSCAAM VAANHPDSKV LAAKQQLIEK MLKSLHKLAS 
    QRQGRLMESL YKHEYFLESD EVEQWIREQE QAASSEDYGQ DFEHLQLLQN KFDDLKHRVE 
    VGADRVDQCE LLAKKLIDSE SPYANEVEKR QEQLRTSWEN LLQLLNQREQ KLHAAGEIHR 
    FHRDVAEALF RIQDKNAALS QELGRDLNSA LALLRKHEGF ENDLVALEAQ LQVLVEDSVR 
    LQAKYPSNAS AIAQQQDKVV AAWNDLKERS TARGDRLAAS SDLQTFLTDV RDIVSWSSNL 
    RAALQAEEHV SDAAGATALK IQHDAIYGEI EAREDKFRYL NELSDSMVQT GHYAAADVEE 
    KCAAMLDERQ KLHAAWNKKK IMLEQKIDLF CFLRDAKQID NLSSSQQAAL SSSDFGQTVE 
    DVQNKIRKHD EFERLIQTQE EKVSLLQEHG RKLIEQRHYD SANIQTILQG VLARRQKVKD 
    LCAVRRYKLE DALLYAKFVR DCAEAKYWIN EKQKKLEADA ASYAEVTNLD EKIKKLQKHQ 
    AFQAEVAANQ GRIQEIQDTG VILLSKQHES SPEIKRAIEI VLEAWQGLLA ELEQRGRGLE 
    EAQDSLEFNS QLDKIEAWIR DKEMMVQASD TGRDLEHCNA LMRKLDDVDS DMRVDDQRVK 
    HINQLADKLI NQAQVPADTQ SVDKRRKDFN YNWRQLQGAL NAYRALLGGA NEIHVFNRDV 
    DDTADRIAEK SLAMSSTDTG RDLAAVEALI RREEALERDM SAVKQKIDQH ETAAEFLIKK 
    YPERGAQHIE RKLEELHKSW GNLQALSVKR QSILNEAYLA HKFVSDVKEL ELWVNDMIKK 
    MNNTQSPSTI NDCETQLELH QERKVEIEGR QEAFAGLKQQ GEQLSKRPQQ QQPDNVRKYL 
    LVLEELHQTL NEAWSERARD LTEAHQLQLF KAQVEQVEIW LANKEAFLNN DDLGDSYTAV 
    ERLLKKHDEF EKLLHADHVD TLQKFANSIL EGEPKDADLI REKLAYILRR KQKLLELSEE 
    RKQRLTQSHQ LQEFLRSLYE IDRWLVQKLQ VALDENYREP SNLQSKIQKH AAFDAELLSN 
    SPRVQSVIHE GERLIRGDHF AKDEIAQQVQ LLEGDWLKLK GASQTKKDKL QQAYDALAFN 
    RSVDEFNNWM DEVELQLSSE DYGKDLAAVS NLLKKHERLE ADVAHHGELA DQLKQKDEQF 
    FQAEHFLRHE IHERATVSIR RYNTLHEPLG IRRENLEDSL SLQQFLRDAE DELQWLAEKQ 
    LVAGSQDLGT SLLSVQGLQK KHNSLEAELT SQEPLIQALL QRGQQMIRDN HFASEQLQYK 
    SELLQKQLVQ LRDLAAIRRL RLLDAVESQL FYVEANEADA WMREKRPVLS SSDYGRDEVS 
    VQGHQKKLEV LQRELTAFKP SIEKVAKLAT GLIERNHFDS SNIAEKNAQV GQEYEDLLRL 
    AKERESRLGE CKKLFEYLRE TEELHEWVGD QMAVTASEDY GEDVEHVEQL ILAFESFVSN 
    LNANEARVEA CLERGDRLIQ ENNPYRSSIK SKRDETKQLW EELKDLVHAR QDALAGAKQV 
    HVYDRVADET IQLINEKDAS LISEDYGQDL ESIQALGRKH QVFESELVGI QGQVDSVLAE 
    AAKLGEIYPD AKEHIEVKRD ETVEAWTDLK EKTAARKNKL SQAEQLQSYF DEYRDLIAWI 
    NEMLAKITAP ELANSVAGAE LLLASTKDHD TEIRTRDETF AKFAANGQQL IKEKHFLAHE 
    VEDKIKVLQA RHELLKHTLN KRREIYELNL DTQLFLKDAE ILEQWISSRE PQLKDTKLGD 
    SIPQVEDLLR RHEDFEKTVA AQEEKFQAIK RITMLEQLFR HQLEQEKISK LQEKERLEKE 
    RLEQLKQREL QRLADERRRA EKQHEHRQNA ASQEKTPIFS SPMVTPAQTS GPQSPALSQV 
    QLRPPFGDDN EHLALQKSSS SGMFGDRLRR GSADANVKRA ESMKVQPKQA KRTPSFTTRR 
    RAQSFRKNQK GEGFDLPPVE IQGSLERKHG LQSGGKKAPV RSWKQFHTVL CGQLVCFFKD 
    ENDFLQQKTA TAPVNILGAK CERADDYTKK KYVFRLKLPD GSEFLFEAPS LDILNDWVRK 
    ISFHASLPPN MQLLSYDESM KQQSSSSPDI KVTSSVESPV SSRNSSPDSQ RRTSGAQVLD 
    GTATPQMAFL QRQMQQQQQQ QQSQPSSPTG GFDQKPPIPP RGAAPVASHR QSQENLVVMR 
    NRQSSNDLQQ SATLPAGLTG VQQNGNGKDD NALLTRNSEA RQSDNPPPLP TTMPPVGGQH 
    QHPQNSHSHQ NQHQAQVQQR INAFNAAASQ QHQPDYFNNN TARQQPQRIP SGRIDSTRKF 
    IEMEAHNNNG GTSSSPKRST INYSSSGASS NGNGNVKIGS GNSSTTTITT STTTHQVTSS 
    SRTVWHLTSS PTSSTKSSST GGSGEPSHAI SNPSYMGWGN TRFESNRPVS LQPDSISFSR 
    VSAESSSESE AQSISSVSGV KGSKGTKEER RSGMFRIFGR KGDKEKEKDK DKRRSSQVPP 
    Q

Genular Protein ID: 1487740887

Symbol: Q9VZQ3_DROME

Name: N/A

UniProtKB Accession Codes:

Database IDs:

Citations:

PubMed ID: 10731132

Title: The genome sequence of Drosophila melanogaster.

PubMed ID: 10731132

DOI: 10.1126/science.287.5461.2185

PubMed ID: 12537568

Title: Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.

PubMed ID: 12537568

PubMed ID: 12537572

Title: Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.

PubMed ID: 12537572

DOI: 10.1186/gb-2002-3-12-research0083

PubMed ID: 12537573

Title: The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective.

PubMed ID: 12537573

PubMed ID: 12537574

Title: Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.

PubMed ID: 12537574

PubMed ID: 16110336

Title: Combined evidence annotation of transposable elements in genome sequences.

PubMed ID: 16110336

DOI: 10.1371/journal.pcbi.0010022

PubMed ID: 17569856

Title: The Release 5.1 annotation of Drosophila melanogaster heterochromatin.

PubMed ID: 17569856

DOI: 10.1126/science.1139815

PubMed ID: 17569867

Title: Sequence finishing and mapping of Drosophila melanogaster heterochromatin.

PubMed ID: 17569867

DOI: 10.1126/science.1139816

PubMed ID: 26109357

Title: Gene Model Annotations for Drosophila melanogaster: Impact of High-Throughput Data.

PubMed ID: 26109357

DOI: .1534/g3.115.018929

PubMed ID: 26109356

Title: Gene Model Annotations for Drosophila melanogaster: The Rule-Benders.

PubMed ID: 26109356

DOI: .1534/g3.115.018937

PubMed ID: 25589440

Title: The Release 6 reference sequence of the Drosophila melanogaster genome.

PubMed ID: 25589440

Sequence Information:

  • Length: 4097
  • Mass: 471640
  • Checksum: EDE01C3857FCAF36
  • Sequence:
  • MTQRDGIIKF ENERIKTLQE ERLHIQKKTF TKWMNSFLIK AKMEVEDLFT DLADGIKLLK 
    LLEIISSEKL GKPNSGRMRV HKIENVNKSL AFLHTKVRLE SIGAEDIVDG NPRLILGLIW 
    TIILRFQIQE IEIDVDEENE SSEKRSAKDA LLLWCQRKTH GYPGVNITDF TNSWRSGLGF 
    NALIHSHRPD LFEYSTIVNS KNSNLDNLNH AFDTAANELG IPSLLDAEDI DSARPDEKSI 
    LTYVASYYHT FARMKNEQKS GKRIANIVGQ LMDADRKKMQ YEGLTTNLLS WIRQKTLELE 
    QRDLPNSLEG IQRELLAFKE YRTIEKPPKY KERSEIEALY FTINTLLKAL NQPPYNPQDG 
    QLVNDIEKAW QILEYAEHHR EVALRDELLR QEKLEQLNYK FEKKSVLREG YLKEMIQVLS 
    DPRYLRQVDA TLKKHEAISA DILARVERFN DLTAMAEELD RENYHGKERV RRREQEVMAK 
    WRQLLELLEN QRLNLSQMSN LMNLLREIAS TTEAVRELQQ QFASEDVGPH LLGVEELLQA 
    HSLQELQVNT YGETLKRFNR QALPYKSSEH KDAALLAQRL ADLEEAYSEL LRRSAARRAR 
    LEEARNFHHF MEDYDNEESW LVDKQRICKT GITAKDLRAV LSLQQKHKAL EDEIKSRKPK 
    SGQMSTAGKR LIGEQHPRSS EIQSRIDSLA EHWQALEALV ELRRRQLEDA AEAYQFYTDA 
    NEAESWLNEK IALVNSRDYG NDEPSAQALL QRHRDLQGEL NAYSGDILNL NQQADKLIKA 
    GICTLELSAA EPELPEVEQE EWVNETRLVP KEVWEDEWVE KLEHKKVTET KMLPHVKSLF 
    PFEGQGMKMD KGEVMLLKSK TNDDWWCVRK DNGVEGFVPA NYVREVEPRP VACIVPKAEK 
    VKSLQKVKKT ILVRQVVPVK RIKPVSVAPK PLVQRRTSTQ SINENADSVE KRQQRINQTY 
    DELQEMAQKR HALLEDSIHL FGFYRECDDF EKWMKEKERM IKSDEGEGVD NAKRKFEKFI 
    TDLSAASKRV EEIDGAVDTF RRQGHSQLDK IIARQRQIHQ IWQRLNNAKA QREKSLEGAS 
    SVELFNRTCD EAKVWMSEKM LQLDTAVITP DLRTVQALQR RHQNLERELA PVEDKVNRVT 
    YLGNSVKNAY PAEKDNVNAR QQEVQDMWQQ VQQRGSDLRN RIESEVGQQV FNNSAKVLLA 
    WIDSVKDQLN ADESARDVET ANNLLKKHND LGDDIRAHDT EFVEVIQLGK QLSDGKPNMA 
    ETVAVIERLK AEQDAIHRGW AEKQKWLLQC VDLQMFNREA DKIDATTKSH EAFLEYNNLG 
    ASLDEVEAIL KRHLDFEKSL MAQDKILKGF SDNADKLISN DHYDSKYIGD RRNQVLGKRK 
    AVKDRAFERK RLLQASKDFH KFAAEADDLK VWLQDKTRIA GDENYRDLSN LPRKLQKHQA 
    FERELRANEG QLRNVTKDGQ ALVQAGNRVP EVESRVADLN KRWKDLLTLS EDKGRKLEQA 
    ASQREHNRSL EDAKKKVDEL DSALRSGDVG NDLRSCKDLI NKQQILESEI TIWDQKVAEL 
    VSTGDDMAHG GHFNAQNIEA GTKELQQRFK DLRDPTQRRR AKLEESLNYH KFVFELDSEF 
    QWINEHLPAA KSNELGQNLH QAQSLHKKHK KLEAEIKGHQ PMINKALVAG QSLISQQHPE 
    REQVESLCQQ LEQAWQDLER HCGERSRKLD MSLKAQQYLF DAGEIESWLG ERNNVLRSTE 
    YGRDRDSAAK LLTKHKTIEL ELDTYSGIVT EMGHSCAAMV AANHPDSKVL AAKQQLIEKM 
    LKSLHKLASQ RQGRLMESLY KHEYFLESDE VEQWIREQEQ AASSEDYGQD FEHLQLLQNK 
    FDDLKHRVEV GADRVDQCEL LAKKLIDSES PYANEVEKRQ EQLRTSWENL LQLLNQREQK 
    LHAAGEIHRF HRDVAEALFR IQDKNAALSQ ELGRDLNSAL ALLRKHEGFE NDLVALEAQL 
    QVLVEDSVRL QAKYPSNASA IAQQQDKVVA AWNDLKERST ARGDRLAASS DLQTFLTDVR 
    DIVSWSSNLR AALQAEEHVS DAAGATALKI QHDAIYGEIE AREDKFRYLN ELSDSMVQTG 
    HYAAADVEEK CAAMLDERQK LHAAWNKKKI MLEQKIDLFC FLRDAKQIDN LSSSQQAALS 
    SSDFGQTVED VQNKIRKHDE FERLIQTQEE KVSLLQEHGR KLIEQRHYDS ANIQTILQGV 
    LARRQKVKDL CAVRRYKLED ALLYAKFVRD CAEAKYWINE KQKKLEADAA SYAEVTNLDE 
    KIKKLQKHQA FQAEVAANQG RIQEIQDTGV ILLSKQHESS PEIKRAIEIV LEAWQGLLAE 
    LEQRGRGLEE AQDSLEFNSQ LDKIEAWIRD KEMMVQASDT GRDLEHCNAL MRKLDDVDSD 
    MRVDDQRVKH INQLADKLIN QAQVPADTQS VDKRRKDFNY NWRQLQGALN AYRALLGGAN 
    EIHVFNRDVD DTADRIAEKS LAMSSTDTGR DLAAVEALIR REEALERDMS AVKQKIDQHE 
    TAAEFLIKKY PERGAQHIER KLEELHKSWG NLQALSVKRQ SILNEAYLAH KFVSDVKELE 
    LWVNDMIKKM NNTQSPSTIN DCETQLELHQ ERKVEIEGRQ EAFAGLKQQG EQLSKRPQQQ 
    QPDNVRKYLL VLEELHQTLN EAWSERARDL TEAHQLQLFK AQVEQVEIWL ANKEAFLNND 
    DLGDSYTAVE RLLKKHDEFE KLLHADHVDT LQKFANSILE GEPKDADLIR EKLAYILRRK 
    QKLLELSEER KQRLTQSHQL QEFLRSLYEI DRWLVQKLQV ALDENYREPS NLQSKIQKHA 
    AFDAELLSNS PRVQSVIHEG ERLIRGDHFA KDEIAQQVQL LEGDWLKLKG ASQTKKDKLQ 
    QAYDALAFNR SVDEFNNWMD EVELQLSSED YGKDLAAVSN LLKKHERLEA DVAHHGELAD 
    QLKQKDEQFF QAEHFLRHEI HERATVSIRR YNTLHEPLGI RRENLEDSLS LQQFLRDAED 
    ELQWLAEKQL VAGSQDLGTS LLSVQGLQKK HNSLEAELTS QEPLIQALLQ RGQQMIRDNH 
    FASEQLQYKS ELLQKQLVQL RDLAAIRRLR LLDAVESQLF YVEANEADAW MREKRPVLSS 
    SDYGRDEVSV QGHQKKLEVL QRELTAFKPS IEKVAKLATG LIERNHFDSS NIAEKNAQVG 
    QEYEDLLRLA KERESRLGEC KKLFEYLRET EELHEWVGDQ MAVTASEDYG EDVEHVEQLI 
    LAFESFVSNL NANEARVEAC LERGDRLIQE NNPYRSSIKS KRDETKQLWE ELKDLVHARQ 
    DALAGAKQVH VYDRVADETI QLINEKDASL ISEDYGQDLE SIQALGRKHQ VFESELVGIQ 
    GQVDSVLAEA AKLGEIYPDA KEHIEVKRDE TVEAWTDLKE KTAARKNKLS QAEQLQSYFD 
    EYRDLIAWIN EMLAKITAPE LANSVAGAEL LLASTKDHDT EIRTRDETFA KFAANGQQLI 
    KEKHFLAHEV EDKIKVLQAR HELLKHTLNK RREIYELNLD TQLFLKDAEI LEQWISSREP 
    QLKDTKLGDS IPQVEDLLRR HEDFEKTVAA QEEKFQAIKR ITMLEQLFRH QLEQEKISKL 
    QEKERLEKER LEQLKQRELQ RLADERRRAE KQHEHRQNAA SQEKTPIFSS PMVTPAQTSG 
    PQSPALSQVQ LRPPFGDDNE HLALQKSSSS GMFGDRLRRG SADANVKRAE SMKVQPKQAK 
    RTPSFTTRRR AQSFRKNQKG EGFDLPPVEI QGSLERKHGL QSGGKKAPVR SWKQFHTVLC 
    GQLVCFFKDE NDFLQQKTAT APVNILGAKC ERADDYTKKK YVFRLKLPDG SEFLFEAPSL 
    DILNDWVRKI SFHASLPPNM QLLSYDESMK QQSSSSPDIK VTSSVESPVS SRNSSPDSQR 
    RTSGAQVLDG TATPQMAFLQ RQMQQQQQQQ QSQPSSPTGG FDQKPPIPPR GAAPVASHRQ 
    SQENLVVMRN RQSSNDLQQS ATLPAGLTGV QQNGNGKDDN ALLTRNSEAR QSGWGNTRFE 
    SNRPVSLQPD SISFSRVSAE SSSESEAQSI SSVSGVKGSK GTKEERRSGM FRIFGRKGDK 
    EKEKDKDKRR SSQVPPQ

Genular Protein ID: 2080964759

Symbol: A8JNJ6_DROME

Name: N/A

UniProtKB Accession Codes:

Database IDs:

Citations:

PubMed ID: 10731132

Title: The genome sequence of Drosophila melanogaster.

PubMed ID: 10731132

DOI: 10.1126/science.287.5461.2185

PubMed ID: 12537568

Title: Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.

PubMed ID: 12537568

PubMed ID: 12537572

Title: Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.

PubMed ID: 12537572

DOI: 10.1186/gb-2002-3-12-research0083

PubMed ID: 12537573

Title: The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective.

PubMed ID: 12537573

PubMed ID: 12537574

Title: Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.

PubMed ID: 12537574

PubMed ID: 16110336

Title: Combined evidence annotation of transposable elements in genome sequences.

PubMed ID: 16110336

DOI: 10.1371/journal.pcbi.0010022

PubMed ID: 17569856

Title: The Release 5.1 annotation of Drosophila melanogaster heterochromatin.

PubMed ID: 17569856

DOI: 10.1126/science.1139815

PubMed ID: 17569867

Title: Sequence finishing and mapping of Drosophila melanogaster heterochromatin.

PubMed ID: 17569867

DOI: 10.1126/science.1139816

Sequence Information:

  • Length: 4337
  • Mass: 497554
  • Checksum: A1CFE66B0425D34D
  • Sequence:
  • MEYNSVLRSN FSRNEYRRYI SYERQSLASQ YEPGGYSALQ TPTPSNRNSA NMTQRDGIIK 
    FENERIKTLQ EERLHIQKKT FTKWMNSFLI KAKMEVEDLF TDLADGIKLL KLLEIISSEK 
    LGKPNSGRMR VHKIENVNKS LAFLHTKVRL ESIGAEDIVD GNPRLILGLI WTIILRFQIQ 
    EIEIDVDEEN ESSEKRSAKD ALLLWCQRKT HGYPGVNITD FTNSWRSGLG FNALIHSHRP 
    DLFEYSTIVN SKNSNLDNLN HAFDTAANEL GIPSLLDAED IDSARPDEKS ILTYVASYYH 
    TFARMKNEQK SGKRIANIVG QLMDADRKKM QYEGLTTNLL SWIRQKTLEL EQRDLPNSLE 
    GIQRELLAFK EYRTIEKPPK YKERSEIEAL YFTINTLLKA LNQPPYNPQD GQLVNDIEKA 
    WQILEYAEHH REVALRDELL RQEKLEQLNY KFEKKSVLRE GYLKEMIQVL SDPRYLRQVD 
    ATLKKHEAIS ADILARVERF NDLTAMAEEL DRENYHGKER VRRREQEVMA KWRQLLELLE 
    NQRLNLSQMS NLMNLLREIA STTEAVRELQ QQFASEDVGP HLLGVEELLQ AHSLQELQVN 
    TYGETLKRFN RQALPYKSSE HKDAALLAQR LADLEEAYSE LLRRSAARRA RLEEARNFHH 
    FMEDYDNEES WLVDKQRICK TGITAKDLRA VLSLQQKHKA LEDEIKSRKP KSGQMSTAGK 
    RLIGEQHPRS SEIQSRIDSL AEHWQALEAL VELRRRQLED AAEAYQFYTD ANEAESWLNE 
    KIALVNSRDY GNDEPSAQAL LQRHRDLQGE LNAYSGDILN LNQQADKLIK AGICTLELSA 
    AEPELPEVEQ EEWVNETRLV PKEVWEDEWV EKLEHKKVTE TKMLPHVKSL FPFEGQGMKM 
    DKGEVMLLKS KTNDDWWCVR KDNGVEGFVP ANYVREVEPR PVACIVPKAE KVKSLQKVKK 
    TILVRQVVPV KRIKPVSVAP KPLVQRRTST QSINENADSV EKRQQRINQT YDELQEMAQK 
    RHALLEDSIH LFGFYRECDD FEKWMKEKER MIKSDEGEGV DNAKRKFEKF ITDLSAASKR 
    VEEIDGAVDT FRRQGHSQLD KIIARQRQIH QIWQRLNNAK AQREKSLEGA SSVELFNRTC 
    DEAKVWMSEK MLQLDTAVIT PDLRTVQALQ RRHQNLEREL APVEDKVNRV TYLGNSVKNA 
    YPAEKDNVNA RQQEVQDMWQ QVQQRGSDLR NRIESEVGQQ VFNNSAKVLL AWIDSVKDQL 
    NADESARDVE TANNLLKKHN DLGDDIRAHD TEFVEVIQLG KQLSDGKPNM AETVAVIERL 
    KAEQDAIHRG WAEKQKWLLQ CVDLQMFNRE ADKIDATTKS HEAFLEYNNL GASLDEVEAI 
    LKRHLDFEKS LMAQDKILKG FSDNADKLIS NDHYDSKYIG DRRNQVLGKR KAVKDRAFER 
    KRLLQASKDF HKFAAEADDL KVWLQDKTRI AGDENYRDLS NLPRKLQKHQ AFERELRANE 
    GQLRNVTKDG QALVQAGNRV PEVESRVADL NKRWKDLLTL SEDKGRKLEQ AASQREHNRS 
    LEDAKKKVDE LDSALRSGDV GNDLRSCKDL INKQQILESE ITIWDQKVAE LVSTGDDMAH 
    GGHFNAQNIE AGTKELQQRF KDLRDPTQRR RAKLEESLNY HKFVFELDSE FQWINEHLPA 
    AKSNELGQNL HQAQSLHKKH KKLEAEIKGH QPMINKALVA GQSLISQQHP EREQVESLCQ 
    QLEQAWQDLE RHCGERSRKL DMSLKAQQYL FDAGEIESWL GERNNVLRST EYGRDRDSAA 
    KLLTKHKTIE LELDTYSGIV TEMGHSCAAM VAANHPDSKV LAAKQQLIEK MLKSLHKLAS 
    QRQGRLMESL YKHEYFLESD EVEQWIREQE QAASSEDYGQ DFEHLQLLQN KFDDLKHRVE 
    VGADRVDQCE LLAKKLIDSE SPYANEVEKR QEQLRTSWEN LLQLLNQREQ KLHAAGEIHR 
    FHRDVAEALF RIQDKNAALS QELGRDLNSA LALLRKHEGF ENDLVALEAQ LQVLVEDSVR 
    LQAKYPSNAS AIAQQQDKVV AAWNDLKERS TARGDRLAAS SDLQTFLTDV RDIVSWSSNL 
    RAALQAEEHV SDAAGATALK IQHDAIYGEI EAREDKFRYL NELSDSMVQT GHYAAADVEE 
    KCAAMLDERQ KLHAAWNKKK IMLEQKIDLF CFLRDAKQID NLSSSQQAAL SSSDFGQTVE 
    DVQNKIRKHD EFERLIQTQE EKVSLLQEHG RKLIEQRHYD SANIQTILQG VLARRQKVKD 
    LCAVRRYKLE DALLYAKFVR DCAEAKYWIN EKQKKLEADA ASYAEVTNLD EKIKKLQKHQ 
    AFQAEVAANQ GRIQEIQDTG VILLSKQHES SPEIKRAIEI VLEAWQGLLA ELEQRGRGLE 
    EAQDSLEFNS QLDKIEAWIR DKEMMVQASD TGRDLEHCNA LMRKLDDVDS DMRVDDQRVK 
    HINQLADKLI NQAQVPADTQ SVDKRRKDFN YNWRQLQGAL NAYRALLGGA NEIHVFNRDV 
    DDTADRIAEK SLAMSSTDTG RDLAAVEALI RREEALERDM SAVKQKIDQH ETAAEFLIKK 
    YPERGAQHIE RKLEELHKSW GNLQALSVKR QSILNEAYLA HKFVSDVKEL ELWVNDMIKK 
    MNNTQSPSTI NDCETQLELH QERKVEIEGR QEAFAGLKQQ GEQLSKRPQQ QQPDNVRKYL 
    LVLEELHQTL NEAWSERARD LTEAHQLQLF KAQVEQVEIW LANKEAFLNN DDLGDSYTAV 
    ERLLKKHDEF EKLLHADHVD TLQKFANSIL EGEPKDADLI REKLAYILRR KQKLLELSEE 
    RKQRLTQSHQ LQEFLRSLYE IDRWLVQKLQ VALDENYREP SNLQSKIQKH AAFDAELLSN 
    SPRVQSVIHE GERLIRGDHF AKDEIAQQVQ LLEGDWLKLK GASQTKKDKL QQAYDALAFN 
    RSVDEFNNWM DEVELQLSSE DYGKDLAAVS NLLKKHERLE ADVAHHGELA DQLKQKDEQF 
    FQAEHFLRHE IHERATVSIR RYNTLHEPLG IRRENLEDSL SLQQFLRDAE DELQWLAEKQ 
    LVAGSQDLGT SLLSVQGLQK KHNSLEAELT SQEPLIQALL QRGQQMIRDN HFASEQLQYK 
    SELLQKQLVQ LRDLAAIRRL RLLDAVESQL FYVEANEADA WMREKRPVLS SSDYGRDEVS 
    VQGHQKKLEV LQRELTAFKP SIEKVAKLAT GLIERNHFDS SNIAEKNAQV GQEYEDLLRL 
    AKERESRLGE CKKLFEYLRE TEELHEWVGD QMAVTASEDY GEDVEHVEQL ILAFESFVSN 
    LNANEARVEA CLERGDRLIQ ENNPYRSSIK SKRDETKQLW EELKDLVHAR QDALAGAKQV 
    HVYDRVADET IQLINEKDAS LISEDYGQDL ESIQALGRKH QVFESELVGI QGQVDSVLAE 
    AAKLGEIYPD AKEHIEVKRD ETVEAWTDLK EKTAARKNKL SQAEQLQSYF DEYRDLIAWI 
    NEMLAKITAP ELANSVAGAE LLLASTKDHD TEIRTRDETF AKFAANGQQL IKEKHFLAHE 
    VEDKIKVLQA RHELLKHTLN KRREIYELNL DTQLFLKDAE ILEQWISSRE PQLKDTKLGD 
    SIPQVEDLLR RHEDFEKTVA AQEEKFQAIK RITMLEQLFR HQLEQEKISK LQEKERLEKE 
    RLEQLKQREL QRLADERRRA EKQHEHRQNA ASQEKTPIFS SPMVTPAQTS GPQSPALSQV 
    QLRPPFGDDN EHLALQKSSS SGMFGDRLRR GSADANVKRA ESMKVQPKQA KRTPSFTTRR 
    RAQSFRKNQK GEGFDLPPVE IQGSLERKHG LQSGGKKAPV RSWKQFHTVL CGQLVCFFKD 
    ENDFLQQKTA TAPVNILGAK CERADDYTKK KYVFRLKLPD GSEFLFEAPS LDILNDWVRK 
    ISFHASLPPN MQLLSYDESM KQQSSSSPDI KVTSSVESPV SSRNSSPDSQ RRTSGAQVLD 
    GTATPQMAFL QRQMQQQQQQ QQSQPSSPTG GFDQKPPIPP RGAAPVASHR QSQENLVVMR 
    NRQSSNDLQQ SATLPAGLTG VQQNGNGKDD NALLTRNSEA RQSDNPPPLP TTMPPVGGQH 
    QHPQNSHSHQ NQHQAQVQQR INAFNAAASQ QHQPDYFNNN TARQQPQRIP SGRIDSTRKF 
    IEMEAHNNNG GTSSSPKRST INYSSSGASS NGNGNVKIGS GNSSTTTITT STTTHQVTSS 
    SRTVWHLTSS PTSSTKSSST GGSGEPSHAI SNPSYMGLHL NNNNDSIGIG LGGWGNTRFE 
    SNRPVSLQPD SISFSRVSAE SSSESEAQSI SSVSGVKGSK GTKEERRSGM FRIFGRKGDK 
    EKEKDKDKRR SSQVPPQ

Genular Protein ID: 2946047316

Symbol: Q7KV70_DROME

Name: N/A

UniProtKB Accession Codes:

Database IDs:

Citations:

PubMed ID: 10731132

Title: The genome sequence of Drosophila melanogaster.

PubMed ID: 10731132

DOI: 10.1126/science.287.5461.2185

PubMed ID: 12537568

Title: Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.

PubMed ID: 12537568

PubMed ID: 12537572

Title: Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.

PubMed ID: 12537572

DOI: 10.1186/gb-2002-3-12-research0083

PubMed ID: 12537573

Title: The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective.

PubMed ID: 12537573

PubMed ID: 12537574

Title: Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.

PubMed ID: 12537574

PubMed ID: 16110336

Title: Combined evidence annotation of transposable elements in genome sequences.

PubMed ID: 16110336

DOI: 10.1371/journal.pcbi.0010022

PubMed ID: 17569856

Title: The Release 5.1 annotation of Drosophila melanogaster heterochromatin.

PubMed ID: 17569856

DOI: 10.1126/science.1139815

PubMed ID: 17569867

Title: Sequence finishing and mapping of Drosophila melanogaster heterochromatin.

PubMed ID: 17569867

DOI: 10.1126/science.1139816

PubMed ID: 26109357

Title: Gene Model Annotations for Drosophila melanogaster: Impact of High-Throughput Data.

PubMed ID: 26109357

DOI: .1534/g3.115.018929

PubMed ID: 26109356

Title: Gene Model Annotations for Drosophila melanogaster: The Rule-Benders.

PubMed ID: 26109356

DOI: .1534/g3.115.018937

PubMed ID: 25589440

Title: The Release 6 reference sequence of the Drosophila melanogaster genome.

PubMed ID: 25589440

Sequence Information:

  • Length: 4207
  • Mass: 483686
  • Checksum: 19796638B789D209
  • Sequence:
  • MTQRDGIIKF ENERIKTLQE ERLHIQKKTF TKWMNSFLIK AKMEVEDLFT DLADGIKLLK 
    LLEIISSEKL GKPNSGRMRV HKIENVNKSL AFLHTKVRLE SIGAEDIVDG NPRLILGLIW 
    TIILRFQIQE IEIDVDEENE SSEKRSAKDA LLLWCQRKTH GYPGVNITDF TNSWRSGLGF 
    NALIHSHRPD LFEYSTIVNS KNSNLDNLNH AFDTAANELG IPSLLDAEDI DSARPDEKSI 
    LTYVASYYHT FARMKNEQKS GKRIANIVGQ LMDADRKKMQ YEGLTTNLLS WIRQKTLELE 
    QRDLPNSLEG IQRELLAFKE YRTIEKPPKY KERSEIEALY FTINTLLKAL NQPPYNPQDG 
    QLVNDIEKAW QILEYAEHHR EVALRDELLR QEKLEQLNYK FEKKSVLREG YLKEMIQVLS 
    DPRYLRQVDA TLKKHEAISA DILARVERFN DLTAMAEELD RENYHGKERV RRREQEVMAK 
    WRQLLELLEN QRLNLSQMSN LMNLLREIAS TTEAVRELQQ QFASEDVGPH LLGVEELLQA 
    HSLQELQVNT YGETLKRFNR QALPYKSSEH KDAALLAQRL ADLEEAYSEL LRRSAARRAR 
    LEEARNFHHF MEDYDNEESW LVDKQRICKT GITAKDLRAV LSLQQKHKAL EDEIKSRKPK 
    SGQMSTAGKR LIGEQHPRSS EIQSRIDSLA EHWQALEALV ELRRRQLEDA AEAYQFYTDA 
    NEAESWLNEK IALVNSRDYG NDEPSAQALL QRHRDLQGEL NAYSGDILNL NQQADKLIKA 
    GICTLELSAA EPELPEVEQE EWVNETRLVP KEVWEDEWVE KLEHKKVTET KMLPHVKSLF 
    PFEGQGMKMD KGEVMLLKSK TNDDWWCVRK DNGVEGFVPA NYVREVEPRP VACIVPKAEK 
    VKSLQKVKKT ILVRQVVPVK RIKPVSVAPK PLVQRRTSTQ SINENADSVE KRQQRINQTY 
    DELQEMAQKR HALLEDSIHL FGFYRECDDF EKWMKEKERM IKSDEGEGVD NAKRKFEKFI 
    TDLSAASKRV EEIDGAVDTF RRQGHSQLDK IIARQRQIHQ IWQRLNNAKA QREKSLEGAS 
    SVELFNRTCD EAKVWMSEKM LQLDTAVITP DLRTVQALQR RHQNLERELA PVEDKVNRVT 
    YLGNSVKNAY PAEKDNVNAR QQEVQDMWQQ VQQRGSDLRN RIESEVGQQV FNNSAKVLLA 
    WIDSVKDQLN ADESARDVET ANNLLKKHND LGDDIRAHDT EFVEVIQLGK QLSDGKPNMA 
    ETVAVIERLK AEQDAIHRGW AEKQKWLLQC VDLQMFNREA DKIDATTKSH EAFLEYNNLG 
    ASLDEVEAIL KRHLDFEKSL MAQDKILKGF SDNADKLISN DHYDSKYIGD RRNQVLGKRK 
    AVKDRAFERK RLLQASKDFH KFAAEADDLK VWLQDKTRIA GDENYRDLSN LPRKLQKHQA 
    FERELRANEG QLRNVTKDGQ ALVQAGNRVP EVESRVADLN KRWKDLLTLS EDKGRKLEQA 
    ASQREHNRSL EDAKKKVDEL DSALRSGDVG NDLRSCKDLI NKQQILESEI TIWDQKVAEL 
    VSTGDDMAHG GHFNAQNIEA GTKELQQRFK DLRDPTQRRR AKLEESLNYH KFVFELDSEF 
    QWINEHLPAA KSNELGQNLH QAQSLHKKHK KLEAEIKGHQ PMINKALVAG QSLISQQHPE 
    REQVESLCQQ LEQAWQDLER HCGERSRKLD MSLKAQQYLF DAGEIESWLG ERNNVLRSTE 
    YGRDRDSAAK LLTKHKTIEL ELDTYSGIVT EMGHSCAAMV AANHPDSKVL AAKQQLIEKM 
    LKSLHKLASQ RQGRLMESLY KHEYFLESDE VEQWIREQEQ AASSEDYGQD FEHLQLLQNK 
    FDDLKHRVEV GADRVDQCEL LAKKLIDSES PYANEVEKRQ EQLRTSWENL LQLLNQREQK 
    LHAAGEIHRF HRDVAEALFR IQDKNAALSQ ELGRDLNSAL ALLRKHEGFE NDLVALEAQL 
    QVLVEDSVRL QAKYPSNASA IAQQQDKVVA AWNDLKERST ARGDRLAASS DLQTFLTDVR 
    DIVSWSSNLR AALQAEEHVS DAAGATALKI QHDAIYGEIE AREDKFRYLN ELSDSMVQTG 
    HYAAADVEEK CAAMLDERQK LHAAWNKKKI MLEQKIDLFC FLRDAKQIDN LSSSQQAALS 
    SSDFGQTVED VQNKIRKHDE FERLIQTQEE KVSLLQEHGR KLIEQRHYDS ANIQTILQGV 
    LARRQKVKDL CAVRRYKLED ALLYAKFVRD CAEAKYWINE KQKKLEADAA SYAEVTNLDE 
    KIKKLQKHQA FQAEVAANQG RIQEIQDTGV ILLSKQHESS PEIKRAIEIV LEAWQGLLAE 
    LEQRGRGLEE AQDSLEFNSQ LDKIEAWIRD KEMMVQASDT GRDLEHCNAL MRKLDDVDSD 
    MRVDDQRVKH INQLADKLIN QAQVPADTQS VDKRRKDFNY NWRQLQGALN AYRALLGGAN 
    EIHVFNRDVD DTADRIAEKS LAMSSTDTGR DLAAVEALIR REEALERDMS AVKQKIDQHE 
    TAAEFLIKKY PERGAQHIER KLEELHKSWG NLQALSVKRQ SILNEAYLAH KFVSDVKELE 
    LWVNDMIKKM NNTQSPSTIN DCETQLELHQ ERKVEIEGRQ EAFAGLKQQG EQLSKRPQQQ 
    QPDNVRKYLL VLEELHQTLN EAWSERARDL TEAHQLQLFK AQVEQVEIWL ANKEAFLNND 
    DLGDSYTAVE RLLKKHDEFE KLLHADHVDT LQKFANSILE GEPKDADLIR EKLAYILRRK 
    QKLLELSEER KQRLTQSHQL QEFLRSLYEI DRWLVQKLQV ALDENYREPS NLQSKIQKHA 
    AFDAELLSNS PRVQSVIHEG ERLIRGDHFA KDEIAQQVQL LEGDWLKLKG ASQTKKDKLQ 
    QAYDALAFNR SVDEFNNWMD EVELQLSSED YGKDLAAVSN LLKKHERLEA DVAHHGELAD 
    QLKQKDEQFF QAEHFLRHEI HERATVSIRR YNTLHEPLGI RRENLEDSLS LQQFLRDAED 
    ELQWLAEKQL VAGSQDLGTS LLSVQGLQKK HNSLEAELTS QEPLIQALLQ RGQQMIRDNH 
    FASEQLQYKS ELLQKQLVQL RDLAAIRRLR LLDAVESQLF YVEANEADAW MREKRPVLSS 
    SDYGRDEVSV QGHQKKLEVL QRELTAFKPS IEKVAKLATG LIERNHFDSS NIAEKNAQVG 
    QEYEDLLRLA KERESRLGEC KKLFEYLRET EELHEWVGDQ MAVTASEDYG EDVEHVEQLI 
    LAFESFVSNL NANEARVEAC LERGDRLIQE NNPYRSSIKS KRDETKQLWE ELKDLVHARQ 
    DALAGAKQVH VYDRVADETI QLINEKDASL ISEDYGQDLE SIQALGRKHQ VFESELVGIQ 
    GQVDSVLAEA AKLGEIYPDA KEHIEVKRDE TVEAWTDLKE KTAARKNKLS QAEQLQSYFD 
    EYRDLIAWIN EMLAKITAPE LANSVAGAEL LLASTKDHDT EIRTRDETFA KFAANGQQLI 
    KEKHFLAHEV EDKIKVLQAR HELLKHTLNK RREIYELNLD TQLFLKDAEI LEQWISSREP 
    QLKDTKLGDS IPQVEDLLRR HEDFEKTVAA QEEKFQAIKR ITMLEQLFRH QLEQEKISKL 
    QEKERLEKER LEQLKQRELQ RLADERRRAE KQHEHRQNAA SQEKTPIFSS PMVTPAQTSG 
    PQSPALSQVQ LRPPFGDDNE HLALQKSSSS GMFGDRLRRG SADANVKRAE SMKVQPKQAK 
    RTPSFTTRRR AQSFRKNQKG EGFDLPPVEI QGSLERKHGL QSGGKKAPVR SWKQFHTVLC 
    GQLVCFFKDE NDFLQQKTAT APVNILGAKC ERADDYTKKK YVFRLKLPDG SEFLFEAPSL 
    DILNDWVRKI SFHASLPPNM QLLSYDESMK QQSSSSPDIK VTSSVESPVS SRNSSPDSQR 
    RTSGAQVLDG TATPQMAFLQ RQMQQQQQQQ QSQPSSPTGG FDQKPPIPPR GAAPVASHRQ 
    SQENLVVMRN RQSSNDLQQS ATLPAGLTGV QQNGNGKDDN ALLTRNSEAR QSDNPPPLPT 
    TMPPVGGQHQ HPQNSHSHQN QHQAQVQQRI NAFNAAASQQ HQPDYFNNNT ARQQPQRIPS 
    GRIDSTRKFI EMEAHNNNGG TSSSPKRSTI NYSSSGASSN GNGWGNTRFE SNRPVSLQPD 
    SISFSRVSAE SSSESEAQSI SSVSGVKGSK GTKEERRSGM FRIFGRKGDK EKEKDKDKRR 
    SSQVPPQ

Genular Protein ID: 3128966939

Symbol: Q7KV69_DROME

Name: N/A

UniProtKB Accession Codes:

Database IDs:

Citations:

PubMed ID: 10731132

Title: The genome sequence of Drosophila melanogaster.

PubMed ID: 10731132

DOI: 10.1126/science.287.5461.2185

PubMed ID: 12537568

Title: Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.

PubMed ID: 12537568

PubMed ID: 12537572

Title: Annotation of the Drosophila melanogaster euchromatic genome: a systematic review.

PubMed ID: 12537572

DOI: 10.1186/gb-2002-3-12-research0083

PubMed ID: 12537573

Title: The transposable elements of the Drosophila melanogaster euchromatin: a genomics perspective.

PubMed ID: 12537573

PubMed ID: 12537574

Title: Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.

PubMed ID: 12537574

PubMed ID: 16110336

Title: Combined evidence annotation of transposable elements in genome sequences.

PubMed ID: 16110336

DOI: 10.1371/journal.pcbi.0010022

PubMed ID: 17569856

Title: The Release 5.1 annotation of Drosophila melanogaster heterochromatin.

PubMed ID: 17569856

DOI: 10.1126/science.1139815

PubMed ID: 17569867

Title: Sequence finishing and mapping of Drosophila melanogaster heterochromatin.

PubMed ID: 17569867

DOI: 10.1126/science.1139816

Sequence Information:

  • Length: 4118
  • Mass: 474048
  • Checksum: A5018507590074E0
  • Sequence:
  • MTQRDGIIKF ENERIKTLQE ERLHIQKKTF TKWMNSFLIK AKMEVEDLFT DLADGIKLLK 
    LLEIISSEKL GKPNSGRMRV HKIENVNKSL AFLHTKVRLE SIGAEDIVDG NPRLILGLIW 
    TIILRFQIQE IEIDVDEENE SSEKRSAKDA LLLWCQRKTH GYPGVNITDF TNSWRSGLGF 
    NALIHSHRPD LFEYSTIVNS KNSNLDNLNH AFDTAANELG IPSLLDAEDI DSARPDEKSI 
    LTYVASYYHT FARMKNEQKS GKRIANIVGQ LMDADRKKMQ YEGLTTNLLS WIRQKTLELE 
    QRDLPNSLEG IQRELLAFKE YRTIEKPPKY KERSEIEALY FTINTLLKAL NQPPYNPQDG 
    QLVNDIEKAW QILEYAEHHR EVALRDELLR QEKLEQLNYK FEKKSVLREG YLKEMIQVLS 
    DPRYLRQVDA TLKKHEAISA DILARVERFN DLTAMAEELD RENYHGKERV RRREQEVMAK 
    WRQLLELLEN QRLNLSQMSN LMNLLREIAS TTEAVRELQQ QFASEDVGPH LLGVEELLQA 
    HSLQELQVNT YGETLKRFNR QALPYKSSEH KDAALLAQRL ADLEEAYSEL LRRSAARRAR 
    LEEARNFHHF MEDYDNEESW LVDKQRICKT GITAKDLRAV LSLQQKHKAL EDEIKSRKPK 
    SGQMSTAGKR LIGEQHPRSS EIQSRIDSLA EHWQALEALV ELRRRQLEDA AEAYQFYTDA 
    NEAESWLNEK IALVNSRDYG NDEPSAQALL QRHRDLQGEL NAYSGDILNL NQQADKLIKA 
    GICTLELSAA EPELPEVEQE EWVNETRLVP KEVWEDEWVE KLEHKKVTET KMLPHVKSLF 
    PFEGQGMKMD KGEVMLLKSK TNDDWWCVRK DNGVEGFVPA NYVREVEPRP VACIVPKAEK 
    VKSLQKVKKT ILVRQVVPVK RIKPVSVAPK PLVQRRTSTQ SINENADSVE KRQQRINQTY 
    DELQEMAQKR HALLEDSIHL FGFYRECDDF EKWMKEKERM IKSDEGEGVD NAKRKFEKFI 
    TDLSAASKRV EEIDGAVDTF RRQGHSQLDK IIARQRQIHQ IWQRLNNAKA QREKSLEGAS 
    SVELFNRTCD EAKVWMSEKM LQLDTAVITP DLRTVQALQR RHQNLERELA PVEDKVNRVT 
    YLGNSVKNAY PAEKDNVNAR QQEVQDMWQQ VQQRGSDLRN RIESEVGQQV FNNSAKVLLA 
    WIDSVKDQLN ADESARDVET ANNLLKKHND LGDDIRAHDT EFVEVIQLGK QLSDGKPNMA 
    ETVAVIERLK AEQDAIHRGW AEKQKWLLQC VDLQMFNREA DKIDATTKSH EAFLEYNNLG 
    ASLDEVEAIL KRHLDFEKSL MAQDKILKGF SDNADKLISN DHYDSKYIGD RRNQVLGKRK 
    AVKDRAFERK RLLQASKDFH KFAAEADDLK VWLQDKTRIA GDENYRDLSN LPRKLQKHQA 
    FERELRANEG QLRNVTKDGQ ALVQAGNRVP EVESRVADLN KRWKDLLTLS EDKGRKLEQA 
    ASQREHNRSL EDAKKKVDEL DSALRSGDVG NDLRSCKDLI NKQQILESEI TIWDQKVAEL 
    VSTGDDMAHG GHFNAQNIEA GTKELQQRFK DLRDPTQRRR AKLEESLNYH KFVFELDSEF 
    QWINEHLPAA KSNELGQNLH QAQSLHKKHK KLEAEIKGHQ PMINKALVAG QSLISQQHPE 
    REQVESLCQQ LEQAWQDLER HCGERSRKLD MSLKAQQYLF DAGEIESWLG ERNNVLRSTE 
    YGRDRDSAAK LLTKHKTIEL ELDTYSGIVT EMGHSCAAMV AANHPDSKVL AAKQQLIEKM 
    LKSLHKLASQ RQGRLMESLY KHEYFLESDE VEQWIREQEQ AASSEDYGQD FEHLQLLQNK 
    FDDLKHRVEV GADRVDQCEL LAKKLIDSES PYANEVEKRQ EQLRTSWENL LQLLNQREQK 
    LHAAGEIHRF HRDVAEALFR IQDKNAALSQ ELGRDLNSAL ALLRKHEGFE NDLVALEAQL 
    QVLVEDSVRL QAKYPSNASA IAQQQDKVVA AWNDLKERST ARGDRLAASS DLQTFLTDVR 
    DIVSWSSNLR AALQAEEHVS DAAGATALKI QHDAIYGEIE AREDKFRYLN ELSDSMVQTG 
    HYAAADVEEK CAAMLDERQK LHAAWNKKKI MLEQKIDLFC FLRDAKQIDN LSSSQQAALS 
    SSDFGQTVED VQNKIRKHDE FERLIQTQEE KVSLLQEHGR KLIEQRHYDS ANIQTILQGV 
    LARRQKVKDL CAVRRYKLED ALLYAKFVRD CAEAKYWINE KQKKLEADAA SYAEVTNLDE 
    KIKKLQKHQA FQAEVAANQG RIQEIQDTGV ILLSKQHESS PEIKRAIEIV LEAWQGLLAE 
    LEQRGRGLEE AQDSLEFNSQ LDKIEAWIRD KEMMVQASDT GRDLEHCNAL MRKLDDVDSD 
    MRVDDQRVKH INQLADKLIN QAQVPADTQS VDKRRKDFNY NWRQLQGALN AYRALLGGAN 
    EIHVFNRDVD DTADRIAEKS LAMSSTDTGR DLAAVEALIR REEALERDMS AVKQKIDQHE 
    TAAEFLIKKY PERGAQHIER KLEELHKSWG NLQALSVKRQ SILNEAYLAH KFVSDVKELE 
    LWVNDMIKKM NNTQSPSTIN DCETQLELHQ ERKVEIEGRQ EAFAGLKQQG EQLSKRPQQQ 
    QPDNVRKYLL VLEELHQTLN EAWSERARDL TEAHQLQLFK AQVEQVEIWL ANKEAFLNND 
    DLGDSYTAVE RLLKKHDEFE KLLHADHVDT LQKFANSILE GEPKDADLIR EKLAYILRRK 
    QKLLELSEER KQRLTQSHQL QEFLRSLYEI DRWLVQKLQV ALDENYREPS NLQSKIQKHA 
    AFDAELLSNS PRVQSVIHEG ERLIRGDHFA KDEIAQQVQL LEGDWLKLKG ASQTKKDKLQ 
    QAYDALAFNR SVDEFNNWMD EVELQLSSED YGKDLAAVSN LLKKHERLEA DVAHHGELAD 
    QLKQKDEQFF QAEHFLRHEI HERATVSIRR YNTLHEPLGI RRENLEDSLS LQQFLRDAED 
    ELQWLAEKQL VAGSQDLGTS LLSVQGLQKK HNSLEAELTS QEPLIQALLQ RGQQMIRDNH 
    FASEQLQYKS ELLQKQLVQL RDLAAIRRLR LLDAVESQLF YVEANEADAW MREKRPVLSS 
    SDYGRDEVSV QGHQKKLEVL QRELTAFKPS IEKVAKLATG LIERNHFDSS NIAEKNAQVG 
    QEYEDLLRLA KERESRLGEC KKLFEYLRET EELHEWVGDQ MAVTASEDYG EDVEHVEQLI 
    LAFESFVSNL NANEARVEAC LERGDRLIQE NNPYRSSIKS KRDETKQLWE ELKDLVHARQ 
    DALAGAKQVH VYDRVADETI QLINEKDASL ISEDYGQDLE SIQALGRKHQ VFESELVGIQ 
    GQVDSVLAEA AKLGEIYPDA KEHIEVKRDE TVEAWTDLKE KTAARKNKLS QAEQLQSYFD 
    EYRDLIAWIN EMLAKITAPE LANSVAGAEL LLASTKDHDT EIRTRDETFA KFAANGQQLI 
    KEKHFLAHEV EDKIKVLQAR HELLKHTLNK RREIYELNLD TQLFLKDAEI LEQWISSREP 
    QLKDTKLGDS IPQVEDLLRR HEDFEKTVAA QEEKFQAIKR ITMLEQLFRH QLEQEKISKL 
    QEKERLEKER LEQLKQRELQ RLADERRRAE KQHEHRQNAA SQEKTPIFSS PMVTPAQTSG 
    PQSPALSQVQ LRPPFGDDNE HLALQKSSSS GMFGDRLRRG SADANVKRAE SMKVQPKQAK 
    RTPSFTTRRR AQSFRKNQKG EGFDLPPVEI QGSLERKHGL QSGGKKAPVR SWKQFHTVLC 
    GQLVCFFKDE NDFLQQKTAT APVNILGAKC ERADDYTKKK YVFRLKLPDG SEFLFEAPSL 
    DILNDWVRKI SFHASLPPNM QLLSYDESMK QQSSSSPDIK VTSSVESPVS SRNSSPDSQR 
    RTSGAQVLDG TATPQMAFLQ RQMQQQQQQQ QSQPSSPTGG FDQKPPIPPR GAAPVASHRQ 
    SQENLVVMRN RQSSNDLQQS ATLPAGLTGV QQNGNGKDDN ALLTRNSEAR QSGSSAFKPV 
    KITRRSYLRT SLQSWGNTRF ESNRPVSLQP DSISFSRVSA ESSSESEAQS ISSVSGVKGS 
    KGTKEERRSG MFRIFGRKGD KEKEKDKDKR RSSQVPPQ

Database document:

This is a preview of the gene's schema. Only a few entries are kept for 'singleCellExpressions,' 'mRNAExpressions,' and other large data arrays for visualization purposes. You can zoom in with the mouse wheel for a closer view, and the text will adjust automatically if necessary. For the full schema, download it here.