Close

1. Identity statement
Reference TypeBook Section
Sitemtc-m21b.sid.inpe.br
Holder Codeisadg {BR SPINPE} ibi 8JMKD3MGPCW/3DT298S
Identifier8JMKD3MGP3W34P/3LKJ8DB
Repositorysid.inpe.br/mtc-m21b/2016/05.03.15.42   (restricted access)
Last Update2017:01.02.16.15.59 (UTC) administrator
Metadata Repositorysid.inpe.br/mtc-m21b/2016/05.03.15.42.50
Metadata Last Update2021:01.03.02.53.34 (UTC) administrator
Secondary KeyINPE--/
DOI10.1007/978-3-319-32467-8_64
ISBN978-331932466-1
Citation KeyRamosTaAlAcCuDi:2016:DiSyPe
TitleDistributed systems performance for big data
Year2016
Access Date2024, Apr. 18
Secondary TypePRE LI
Number of Files1
Size1136 KiB
2. Context
Author1 Ramos, Marcelo Paiva
2 Tasinaffo, Paulo Marcelo
3 Almeida, Eugênio Sper de
4 Achite, Luis Marcelo
5 Cunha, Adilson Marques da
6 Dias, Luiz Alberto Vieira
Resume Identifier1
2
3 8JMKD3MGP5W/3C9JH2S
4
5
6 8JMKD3MGP5W/3C9JHML
Group1 SSS-CPT-INPE-MCTI-GOV-BR
2
3 SSS-CPT-INPE-MCTI-GOV-BR
Affiliation1 Instituto Nacional de Pesquisas Espaciais (INPE)
2 Instituto Tecnológico de Aeronáutica (ITA)
3 Instituto Nacional de Pesquisas Espaciais (INPE)
4 Instituto Tecnológico de Aeronáutica (ITA)
5 Instituto Tecnológico de Aeronáutica (ITA)
6 Instituto Tecnológico de Aeronáutica (ITA)
Author e-Mail Address1 marcelopaivaramos@gmail.com
2
3 eugenio.almeida@inpe.br
EditorLafiti, Shahram
Series EditorKacprzyk, Janusz
Book TitleInformation technology: new generations
PublisherSpringer Verlag
Volume448
Pages733-744
Series TitleAdvances in Intelligent Systems and Computing
History (UTC)2016-05-03 15:45:52 :: simone -> administrator :: 2016
2016-07-04 12:30:00 :: administrator -> simone :: 2016
2016-12-21 13:15:34 :: simone -> administrator :: 2016
2021-01-03 02:53:34 :: administrator -> simone :: 2016
3. Content and structure
Is the master or a copy?is the master
Content Stagecompleted
Transferable1
Content TypeExternal Contribution
Version Typepublisher
KeywordsBig data
Climate prediction
Cluster HPC
Distributed systems
Hadoop
Hive
Python
AbstractThis paper describes a methodology for working with distributed systems, and achieve performance in Big Data, through the framework Hadoop, Python programming language, and Apache Hive module. The efficiency of the proposed methodology is tested through a case study that addresses a real problem found in the supercomputing environment of the Center for Weather Forecasting and Climate Studies linked to the Brazilian Institute for Space Research (CPTEC/INPE), which provides Society a work able to predict disasters and save people lives. In all three experiments involving the issue, using the Cray XT-6 supercomputer: (i) the first issue involves programming in Python and a sequential and monoprocessed arquitecture; (ii) the second uses Python and Hadoop framework, over parallel and distributed arquitecture; (iii) the latter combines Hadoop and Hive in a parallel and distributed arquitecture. The main results of these experiments are compared, discussed, and topics beyond the scope in this research are exposed as recommendations and suggestions for future work.
AreaMET
Arrangementurlib.net > BDMCI > Fonds > Produção anterior à 2021 > SESSS > Distributed systems performance...
doc Directory Contentaccess
source Directory Contentthere are no files
agreement Directory Content
agreement.html 03/05/2016 12:42 1.9 KiB 
4. Conditions of access and use
Languageen
Target FileITNG1_DistributedSystemsPerformanceForBigData.pdf
User Groupself-uploading-INPE-MCTI-GOV-BR
simone
Visibilityshown
Read Permissiondeny from all and allow from 150.163
Update Permissionnot transferred
5. Allied materials
Mirror Repositoryurlib.net/www/2011/03.29.20.55
Next Higher Units8JMKD3MGPCW/43SRFME
DisseminationBNDEPOSITOLEGAL
Host Collectionsid.inpe.br/mtc-m21b/2013/09.26.14.25.20
6. Notes
Empty Fieldsarchivingpolicy archivist callnumber city copyholder copyright creatorhistory descriptionlevel e-mailaddress edition format issn label lineage mark nextedition notes numberofvolumes orcid parameterlist parentrepositories previousedition previouslowerunit progress project readergroup rightsholder schedulinginformation secondarydate secondarymark session shorttitle sponsor subject tertiarymark tertiarytype translator url
7. Description control
e-Mail (login)simone
update 


Close