default search action

combined dblp search
author search
venue search
publication search

ask others

Luckeciano Carvalho Melo

Luckeciano C. Melo

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[c7]
- view
  - electronic edition @ ceur-ws.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lacatoda/LuzBMOOMS26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lacatoda/LuzBMOOMS26
Murilo Lopes da Luz, Bruno Brandão, Luana G. B. Martins, Gustavo Oliveira, Bryan L. M. de Oliveira, Luckeciano Carvalho Melo, Telma W. L. Soares:
Partial Reasoning in Language Models: Search and Refinement Guided by Uncertainty. LaCATODA@AAAI 2026: 106-116
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-12040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-12040
Murilo Lopes da Luz, Bruno Brandão, Luana G. B. Martins, Gustavo Oliveira, Bryan L. M. de Oliveira, Luckeciano Carvalho Melo, Telma Woerle de Lima:
Partial Reasoning in Language Models: Search and Refinement Guided by Uncertainty. CoRR abs/2601.12040 (2026)
2025
[c6]
- view
- export record
  dblp key:
  - conf/icml/OliveiraMBLSM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/OliveiraMBLSM25
Bryan Lincoln Marques de Oliveira, Luana Guedes Barros Martins, Bruno Brandão, Murilo Lopes da Luz, Telma Woerle de Lima Soares, Luckeciano Carvalho Melo:
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning. ICML 2025
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-11250
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-11250
Zihuiwen Ye, Luckeciano Carvalho Melo, Younesse Kaddar, Phil Blunsom, Sam Staton, Yarin Gal:
Uncertainty-Aware Step-wise Verification with Generative Reward Models. CoRR abs/2502.11250 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-12257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-12257
Bryan L. M. de Oliveira, Luana G. B. Martins, Bruno Brandão, Luckeciano C. Melo:
InfoQuest: Evaluating Multi-Turn Dialogue Agents for Open-Ended Conversations with Hidden Context. CoRR abs/2502.12257 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-00819
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-00819
Luckeciano C. Melo, Alessandro Abate, Yarin Gal:
Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning. CoRR abs/2510.00819 (2025)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-03527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-03527
Bryan L. M. de Oliveira, Felipe Vieira Frujeri, Marcos P. C. M. Queiroz, Luana G. B. Martins, Telma W. L. Soares, Luckeciano C. Melo:
Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments. CoRR abs/2511.03527 (2025)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-24940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-24940
Augusto B. Corrêa, Yoav Gelberg, Luckeciano C. Melo, Ilia Shumailov, André Grahl Pereira, Yarin Gal:
Iterative Deployment Improves Planning Skills in LLMs. CoRR abs/2512.24940 (2025)
2024
[c5]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/MeloTAG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MeloTAG24
Luckeciano Carvalho Melo, Panagiotis Tigas, Alessandro Abate, Yarin Gal:
Deep Bayesian Active Learning for Preference Modeling in Large Language Models. NeurIPS 2024
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10023
Luckeciano C. Melo, Panagiotis Tigas, Alessandro Abate, Yarin Gal:
Deep Bayesian Active Learning for Preference Modeling in Large Language Models. CoRR abs/2406.10023 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-07812
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-07812
Luckeciano C. Melo, Alessandro Abate, Yarin Gal:
Temporal-Difference Variational Continual Learning. CoRR abs/2410.07812 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14038
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14038
Bryan L. M. de Oliveira, Murilo Lopes da Luz, Bruno Brandão, Luana G. B. Martins, Telma Woerle de Lima Soares, Luckeciano C. Melo:
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning. CoRR abs/2410.14038 (2024)
2022
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/BrandaoLSMM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/BrandaoLSMM22
Bruno Brandão, Telma Woerle de Lima, Anderson Soares, Luckeciano C. Melo, Marcos R. O. A. Máximo:
Multiagent Reinforcement Learning for Strategic Decision Making and Control in Robotic Soccer Through Self-Play. IEEE Access 10: 72628-72642 (2022)
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Melo22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Melo22
Luckeciano C. Melo:
Transformers are Meta-Reinforcement Learners. ICML 2022: 15340-15359
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-06614
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-06614
Luckeciano C. Melo:
Transformers are Meta-Reinforcement Learners. CoRR abs/2206.06614 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/jirs/MeloMM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jirs/MeloMM21
Luckeciano Carvalho Melo, Dicksiano Carvalho Melo, Marcos Ricardo Omena de Albuquerque Máximo:
Learning Humanoid Robot Running Motions with Symmetry Incentive through Proximal Policy Optimization. J. Intell. Robotic Syst. 102(3): 54 (2021)
2020
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/SantanaMCBSOC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/SantanaMCBSOC20
Marlesson R. O. Santana, Luckeciano C. Melo, Fernando H. F. Camargo, Bruno Brandão, Anderson Soares, Renan M. Oliveira, Sandor Caetano:
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces. ICDM (Workshops) 2020: 189-197
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/recsys/SantanaMCBSOC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/recsys/SantanaMCBSOC20
Marlesson R. O. Santana, Luckeciano C. Melo, Fernando H. F. Camargo, Bruno Brandão, Anderson Soares, Renan M. Oliveira, Sandor Caetano:
Contextual Meta-Bandit for Recommender Systems Selection. RecSys 2020: 444-449
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07035
Marlesson R. O. Santana, Luckeciano C. Melo, Fernando H. F. Camargo, Bruno Brandão, Anderson Soares, Renan M. Oliveira, Sandor Caetano:
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces. CoRR abs/2010.07035 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/larc/MeloM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/larc/MeloM19
Luckeciano Carvalho Melo, Marcos Ricardo Omena Albuquerque Máximo:
Learning Humanoid Robot Running Skills through Proximal Policy Optimization. LARS/SBR/WRE 2019: 37-42
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-00270
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-00270
Luckeciano Carvalho Melo, Marcos Ricardo Omena Albuquerque Máximo, Adilson Marques da Cunha:
Learning Humanoid Robot Motions Through Deep Neural Networks. CoRR abs/1901.00270 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10232
Luckeciano Carvalho Melo, Marcos Ricardo Omena Albuquerque Máximo, Adilson Marques da Cunha:
Bottom-Up Meta-Policy Search. CoRR abs/1910.10232 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10620
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10620
Luckeciano Carvalho Melo, Marcos R. O. A. Máximo:
Learning Humanoid Robot Running Skills through Proximal Policy Optimization. CoRR abs/1910.10620 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.