Top Important LLM Papers for the Week from 13/11 to 19/11
Stay Relevant to Recent Large Language Models Research
Large language models (LLMs) have advanced rapidly in recent years. As new generations of models are developed, it’s important for researchers and engineers to stay informed on the latest progress. This article summarizes some of the most important LLM papers published during the third week of November.
The papers cover various topics shaping the next generation of language models, from model optimization and scaling to reasoning, benchmarking, and enhancing performance. Keeping up with novel LLM research across these domains will help guide continued progress toward models that are more capable, robust, and aligned with human values.
Table of Contents:
LLM Progress & Benchmarking
LLM Training & Optimization
LLM Fine Tuning
LLM Reasoning
Responsible AI & LLM Ethics
1. LLM Progress & Benchmarking
PEARL: Personalizing Large Language Model Writing Assistants with Generation-Calibrated Retrievers
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks
Llamas Know What GPTs Don’t Show: Surrogate Models for Confidence Estimation
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data
Fast Chain-of-Thought: A Glance of Future from Parallel Decoding Leads to Answers Faster
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4
Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
2. LLM Training & Optimization & EvaluationÂ
SelfEval: Leveraging the discriminative nature of generative models for evaluation
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models
DiLoCo: Distributed Low-Communication Training of Language Models
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
3. LLM Fine Tuning
Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying
UT5: Pretraining Non autoregressive T5 with unrolled denoising
4. LLM Reasoning
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations
ADaPT: As-Needed Decomposition and Planning with Language Models
5. LLM Ethics &Â SafetyÂ
6. Transformers & Attention ModelsÂ
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems
7. Prompt EngineeringÂ
Are you looking to start a career in data science and AI and do not know how? I offer data science mentoring sessions and long-term career mentoring:
Mentoring sessions: https://lnkd.in/dXeg3KPW
Long-term mentoring: https://lnkd.in/dtdUYBrM