- Aug 26 22:00:04 2025 Hermes_4_Technical_Report-aug2025-arxiv-2508.18255v1.pdf
- Aug 26 11:53:22 2025 Reinforcement_Learning_with_Rubric_Anchors-aug2025-arxiv-2508.12790v1.pdf
- Aug 26 05:06:38 2025 Jet-Nemotron-Efficient_Language_Model_with_Post_Neural_Architecture_Search-arxiv-2508.15884v1.pdf
- Aug 26 05:04:44 2025 AgentFly-Fine-tuning_LLM_Agents_without_Fine-tuning_LLMs-aug2025-arxiv-2508.16153v1.pdf
- Aug 25 09:30:41 2025 Motif_2.6B_Technical_Report-aug2025-arxiv-2508.09148v1.pdf
- Aug 24 08:54:55 2025 DEEP_THINK_WITH_CONFIDENCE-aug2025-arxiv-2508.15260v1.pdf
- Aug 24 08:51:44 2025 Guiding_an_Automatic_Speech_Recognition_Decoder_using_Large_Language_Models-aug2025-arxiv-2508.02228v1.pdf
- Aug 24 08:11:39 2025 Intern-S1-A_Scientific_Multimodal_Foundation_Model-aug2025-arxiv-2508.15763v1.pdf
- Aug 23 20:42:23 2025 HIRAG-Hierarchical-Thought_Instruction-Tuning_Retrieval-Augmented_Generation-jul2025-arxiv-2507.05714v2.pdf
- Aug 22 16:30:07 2025 Matrix_Calculus_for_Machine_Learning_and_Beyond-jan2025-arxiv-2501.14787v1.pdf
- Aug 22 11:49:03 2025 Retrospective_Sparse_Attention_for_Efficient_Long-Context_Generation-aug2025-arxiv-2508.09001v1.pdf
- Aug 21 23:08:53 2025 XQUANT-Breaking_the_Memory_Wall_for_LLM_Inference_with_KV_Cache_Rematerialization-aug2025-arxiv-2508.10395v1.pdf
- Aug 20 12:02:44 2025 MDPO-OVERCOMING_THE_TRAINING-INFERENCE_DIVIDE_OF_MASKED_DIFFUSION_LANGUAGE_MODELS-aug2025-arxiv-2508.13148v1.pdf
- Aug 20 08:01:15 2025 BeyondWeb-Lessons_from_Scaling_Synthetic_Data_for_Trillion-scale_Pretraining-aug2025-arxiv-2508.10975v1.pdf
- Aug 18 23:28:18 2025 Apriel-Nemotron-15B-Thinker-aug2025-arxiv-2508.10948v1.pdf
- Aug 12 15:53:36 2025 Grove_MoE-_Towards_Efficient_and_Superior_MoE_LLMs_with_Adjugate_Experts-aug2025-arxiv-2508.07785v1.pdf
- Aug 12 09:43:35 2025 Part_I-Tricks_or_Traps_A_Deep_Dive_into_RL_for_LLM_Reasoning-aug2025-arxiv-2508.08221v1.pdf
- Aug 11 07:00:34 2025 GLM-4.5_Agentic_Reasoning_and_Coding_ARC_Foundation_Models -aug2025-arxiv-2508.06471v1.pdf
- Aug 10 10:49:47 2025 Let_the_Expert_Stick_to_His_Last-Expert-Specialized_Fine-Tuning_for_Sparse_Architectural_LLMs-jul2024-arxiv-2407.01906v2.pdf
- Aug 9 18:31:03 2025 Training-Free_Long-Context_Scaling_of_Large_Language_Models-feb2024-arxiv-2402.17463v2.pdf
- Aug 9 18:29:45 2025 MInference_1.0-Accelerating_Pre-filling_for_Long-Context_LLMs_via_Dynamic_Sparse_Attention-jul2024-arxiv-2407.02490v2.pdf
- Aug 9 18:28:56 2025 Qwen2.5-1M_Technical_Report-jan2025-arxiv-2501.15383v1.pdf
- Aug 8 23:53:48 2025 ON_THE_GENERALIZATION_OF_SFT-A_REINFORCEMENT_LEARNING_PERSPECTIVE_WITH_REWARD_RECTIFICATION-aug2025-arxiv-2508.05629v1.pdf
- Aug 8 14:23:11 2025 R-Zero-Self-Evolving_Reasoning_LLM_from_Zero_Data-aug2025-arxiv-2508.05004v1.pdf
- Aug 8 07:23:57 2025 Multi-module_GRPO-Composing_Policy_Gradients_and_Prompt_Optimization_for_Language_Model_Programs-aug2025-arxiv-2508.04660v1.pdf
- Aug 7 13:26:21 2025 Learning_Formal_Mathematics_From_Intrinsic_Motivation-aug2025-arxiv-2407.00695v2.pdf
- Aug 7 13:21:25 2025 Assessing_Adaptive_World_Models_in_Machines_with_Novel_Games-jul2025-arxiv-2507.12821v2.pdf
- Aug 5 10:00:20 2025 Unifying_Mixture_of_Experts_and_Multi-Head_Latent_Attention_for_Efficient_Language_Models-aug2025-arxiv-2508.01261v1.pdf
- Aug 3 21:29:39 2025 UloRL-An_Ultra-Long_Output_RL_Approach_for_Advancing_LLM_Reasoning_Abilities-jul2025-arxiv-2507.19766v1.pdf
- Aug 2 15:28:50 2025 Test-Time_Scaling_with_Reflective_Generative_Model-jul2025-arxiv-2507.01951v2.pdf
- Aug 2 07:02:58 2025 Titans-Learning_to_Memorize_at_Test_Time-dec2024-arxiv-2501.00663v1.pdf
- Jul 31 15:01:09 2025 GEPA-Reflective_Prompt_Evolution_Can_Outperform_Reinforcement_Learning-jul2025-arxiv-2507.19457v1.pdf
- Jul 30 09:45:14 2025 A_Survey_of_Self-Evolving_Agents-_On_Path_to_Artificial_Super_Intelligence-jul2025-arxiv-2507.21046v1.pdf
- Jul 30 02:39:33 2025 What_Lives-A_meta-analysis_of_diverse_opinions_on_the_definition_of_life-may2025-arxiv-2505.15849v1.pdf
- Jul 29 10:48:21 2025 QWEN-Group_Sequence_Policy_Optimization-jul2025-arxiv-2507.18071v2.pdf
- Jul 28 06:59:39 2025 KAT-V1- Kwai-AutoThink-Technical_Report-jul2025-arxiv-2507.08297v3.pdf
- Jul 25 14:27:02 2025 QWEN-Group_Sequence_Policy_Optimization-jul2025-arxiv-2507.18071v1.pdf
- Jul 25 10:09:45 2025 Learning_without_training-The_implicit_dynamics_of_in-context_learning-jul2025-arxiv-2507.16003v1.pdf
- Jul 25 08:23:09 2025 AlphaGo_Moment_for_Model_Architecture_Discovery-jul2025-arxiv-2507.18074v1.pdf
- Jul 23 17:30:03 2025 Deep_Researcher_with_Test-Time_Diffusion-jul2025-arxiv-2507.16075v1.pdf
- Jul 23 08:32:48 2025 LADDER- SELF-IMPROVING_LLMS_THROUGH_RECURSIVE_PROBLEM_DECOMPOSITION-mar2025-arxiv-2503.00735v3.pdf
- Jul 15 16:34:47 2025 TABM-ADVANCING_TABULAR_DEEP_LEARNING_WITH_PARAMETER-EFFICIENT_ENSEMBLING-feb2025-arxiv-2410.24210v3.pdf
- Jul 12 12:57:41 2025 Step-by-Step_Diffusion-An_Elementary_Tutorial-jun2024-arxiv-2406.08929v2.pdf
- Jul 9 11:24:45 2025 MemAgent_Reshaping_Long-Context_LLM_with_Multi-Conv_RL-based_-Memory_Agent-jul2025-arxiv-2507.02259v1.pdf
- Jul 7 10:16:04 2025 LDP-Learning_Long-Context_Diffusion_Policies_via_Past-Token_Prediction-may2025-arxiv-2505.09561v2.pdf
- Jul 7 09:02:01 2025 EBT-Energy-Based_Transformers_are_Scalable_Learners_and_Thinkers-jul2025-arxiv-2507.02092v1.pdf
- Jul 6 16:07:15 2025 Darwin_Godel_Machine_Open-Ended_Evolution_of_Self-Improving_Agents-may2025-arxiv-2505.22954v1.pdf
- Jul 6 11:11:17 2025 SEAL-Self-Adapting_Language_Models-jun2025-arxiv-2506.10943v1.pdf
- Jul 4 20:32:58 2025 Steering_Your_Diffusion_Policy_with_Latent_Space_Reinforcement_Learning-jun2025-arxiv-2506.15799v2.pdf
- Jul 4 00:47:22 2025 HRM-Hierarchical_Reasoning_Model-jun2025-arxiv-2506.21734v1.pdf
- Jun 24 00:17:56 2025 TabDPT_Scaling_Tabular_Foundation_Models-oct2024-arxiv-2410.18164v1.pdf
- Jun 24 00:08:28 2025 TabArena-A_Living_Benchmark_for_Machine_Learning_on_Tabular_Data-jun2025-arxiv-2506.16791v1.pdf
- Jun 23 16:43:39 2025 REASONING_WITH_EXPLORATION-AN_ENTROPY_PERSPECTIVE-jun2025-arxiv-2506.14758v1.pdf
- Jun 14 15:19:36 2025 Self-Adapting_Language-Models-SEAL-jun2025-arxiv-2506.10943v1.pdf
- Jun 2 02:11:10 2025 Learning_to_Reason_with_External_Rewards-may2025-arxiv-2505.19590v1.pdf
- May 23 09:13:07 2025 What_Lives_A_meta_analysis_of_diverse_opinions_on_the_definition_of_life-may2025-arxiv-2505.15849v1.pdf
- May 21 20:30:11 2025 Large_Language_Diffusion_Models-feb2025-arxiv-2502.09992v2.pdf
- May 21 11:07:41 2025 Insights_into_DeepSeek-V3-Scaling_Challenges_and_Reflections_on_Hardware_for_AI_Architectures-may2025-arxiv-2505.09343v1.pdf
- May 10 21:51:44 2025 Absolute_Zero-Reinforced_Self-play_Reasoning_with_Zero_Data-may2025-arxiv-2505.03335v2.pdf
- May 7 08:31:59 2025 Absolute_Zero_Reinforced_Self-play_Reasoning_with_Zero_Data-may2025-arxiv-2505.03335v1.pdf
- May 5 22:17:25 2025 Bytedance-Monolith_Real_Time_Recommendation_System_With_Collisionless_Embedding_Table-sep2022-arxiv-2209.07663v2.pdf
- May 3 18:53:59 2025 CODE_IO_Condensing_Reasoning_Patterns_via_Code_Input-Output_Prediction-feb2025-arxiv-2502.07316v2.pdf
- Apr 26 20:24:50 2025 Bahdanu-Cho-Bengio-NEURAL_MACHINE_TRANSLATION_BY_JOINTLY_LEARNING_TO_ALIGN_AND_TRANSLATE-2015-arxiv-1409.0473v7.pdf
- Apr 18 19:28:13 2025 Murphy-Reinforcement_Learning_A_Comprehensive_Overview-mar2025-arxiv-2412.05265v2.pdf
- Apr 2 10:17:48 2025 Command_A-An_Enterprise-Ready Large_Language_Model-arxiv-apr2025-2504.00698v1.pdf
- Apr 2 09:43:44 2025 The_Information_Theory_of_Individuality-arxiv-dec2014-1412.2447v1.pdf
- Apr 1 21:13:40 2025 Lecture_Notes_on_High-Dimensional_Data-arxiv-sep2024-2101.05841v7.pdf
- Mar 31 21:38:34 2025 Kafri-The_Second_Law_and_Informatics-arxiv-2006-0701016v2.pdf
- Mar 27 15:28:43 2025 Qwen2.5-Omni_Technical_Report-arxiv-mar2025-2503.20215v1.pdf
- Mar 27 11:19:25 2025 AGI_Governments_and_Free_Societies-arxiv-feb2025-2503.05710v2.pdf
- Mar 26 10:28:05 2025 Ryan_Williams-Simulating_Time_With_Square-Root_Space-arxiv-feb2025-2502.17779v1.pdf
- Mar 25 18:11:00 2025 A_THEORY_OF_USABLE_INFORMATION_UNDER_COMPUTATIONAL_CONSTRAINTS-2020-arxiv-2002.10689v1.pdf
- Mar 25 08:10:54 2025 DeepSeek-V3_Technical_Report-feb2025-arxiv-2412.19437v2.pdf
- Mar 12 21:27:26 2025 Generalized_Kullback-Leibler_Divergence_Loss-arxiv-mar2025-2503.08038v1.pdf
- Mar 10 14:18:06 2025 Probabilistic_Artificial_Intelligence-arxiv-mar2025-2502.05244v1.pdf
- Jan 17 17:35:52 2025 Foundations_of_Large_Language_Models-jan2025-arxiv-2501.09223v1.pdf
- Jan 15 09:33:18 2025 The_AI_Scientist_Towards_Fully_Automated_Open-Ended_Scientific_Discovery-sep2024-arxiv-2408.06292v3.pdf
- Jan 13 11:48:27 2025 Samba-ASR_State-Of-The-Art_Speech_Recognition_Leveraging_structured_State-Space_Models-arxiv-jan2025-2501.02832v3.pdf
- Dec 29 16:02:16 2024 Scaling_of_Search_and_Learning_A_Roadmap_to_Reproduce_o1_from_Reinforcement_Learning_Perspective-arxiv-2412.14135v1-dec2024.pdf
- Dec 18 11:19:31 2024 Scaling_LLM_Test-Time_Compute_Optimally_can_be_More_Effective_than_Scaling_Model_Parameters-aug2024-arxiv-2408.03314v1.pdf
- Dec 18 11:17:58 2024 The_Unbearable_Slowness_of_Being_Why_do_we-live_at_10_bits_per-sec-dec2024-arxiv-2408.10234v2.pdf
- Dec 17 17:44:44 2024 THE_COMPLEXITY_DYNAMICS_OF_GROKKING-dec2024-arxiv-2412.09810v1.pdf
- Dec 12 08:33:57 2024 MASK_is_All_You_Need-2024-arxiv-2412.06787v2.pdf
- Dec 9 09:15:49 2024 Reinforcement_Learning_An_Overview-arxiv-2412.05265v1-2024.pdf
- Dec 2 09:24:54 2024 Mixture_of_A_Million_Experts-arxiv-2407.04153v1-2024.pdf
- Nov 26 17:10:04 2024 Understanding_LLM_Embeddings_for_Regression-arxiv-2411.14708v1.pdf
- Nov 14 20:30:27 2024 Neural_Machine_Translation_by_Jointly_Learning_to_Align_and_Translate-arxiv-1409.0473v7.pdf
- Nov 14 20:29:09 2024 Sequence_to_Sequence_Learning_with_Neural_Networks-arxiv-1409.3215v3.pdf
- Nov 13 09:42:22 2024 TabM_Advancing_Tabular_Deep_Learning-arxiv-2410.24210v2.pdf
- Nov 8 16:47:29 2024 Denoising_Diffusion_Probabilistic_Models_in_Six_Simple_Steps-arxiv-2402.04384v2.pdf
- Nov 6 10:45:32 2024 A_closed-form_expression_for_the_Sharma-Mittal_entropy_of_exponential_families-arxiv-1112.4221v1.pdf
- Oct 30 08:49:35 2024 Beyond_Autoregression_Discrete_Diffusion_for_Complex_Reasoning_and_Planning_-_arxiv-2410.14157v1.pdf
- Oct 28 15:22:36 2024 Patrick_Kidger_-_On_Neural_Differential_Equations_-_PhD_thesis_2021_-_arxiv-2202.02435v1.pdf
- Oct 28 13:32:57 2024 A_First_Course_in_Monte_Carlo_Methods-arxiv-2405.16359v1.pdf
- Oct 12 22:41:13 2024 Lets_Verify_Step_by_Step-OpenAI-2023-arxiv-2305.20050v1.pdf
- Oct 12 22:18:06 2024 A_Primer_on_the_Inner_Workings_of_Transformer-Bassed_Language_Models-2024-arxiv-2405.00208v2.pdf
- Oct 7 09:53:27 2024 Alicia_Curth-Alan_Jeffares-Michaela_van_der_Schaar_-_Why_do_Random_Forests_Work_-_arxiv-2402.01502v1.pdf
- Oct 4 18:12:09 2024 Alicia_Curth_-_Classical_Statistical_In-Sample_Intuitions_Dont_Generalize_Well_-_arxiv-2409.18842v1.pdf
- Oct 2 18:48:03 2024 Bennett-Welsh-Ciaunica_-_Why_Is_Anything_Conscious_-_arxiv-2409.14545v1-2024.pdf
- Sep 26 21:36:19 2024 Blass-Gurevich_-_Negative_probabilities_II_What_They_Are_and_What_They_Are_For-arxiv-1807.10382-2018.pdf
- Sep 26 21:07:26 2024 Abramsky-Brandenburger_-_An_Operational_Interpretation_of_Negative_Probabilities_and_No-Signalling_Models_-_arxiv-1401.2561v2.pdf
- Sep 23 08:15:09 2024 What_is_Entropy-John_Baez_-_arxiv-2409.09232v1.pdf
- Jun 18 19:58:20 2024 Software_in_the_natural_world_A_computational_approach_to_hierarchical_emergence-arxiv-2402.09090v2.pdf
- Apr 20 10:27:35 2024 Building_Cross-Sectional_Systematic_Strategies_By_Learning_to_Rank_-_arxiv-2012.07149.pdf
- Feb 23 20:40:42 2024 Experts_Dont_Cheat_Learning_What_You_Dont_Know_By_Predicting_Pairs-arxiv-2402.08733.pdf
- Nov 27 23:29:19 2023 Conformal_Prediction_for_Time_Series_with_Modern_Hopfield_Networks_-_arxiv-2303.12783.pdf
- Nov 25 08:49:08 2023 Portfolio_Construction_with_Gaussian_Mixture_Returns_-_arxiv-2205.04563.pdf
- Oct 23 18:16:57 2023 Good_Enough_Practices_in_Scientific_Computing_-_arxiv-1609.00037.pdf
- Aug 13 18:46:12 2023 A_Gentle_Introduction_to_Conformal_Prediction_and_Distribution-Free_Uncertainty_Quantification_-_arxiv-2107.07511.pdf
- Aug 4 16:54:17 2023 NNT_Taleb_-_Statistical_Consequences_of_Fat_Tails_-_arxiv-2001.10488.pdf.pdf
- May 15 15:33:46 2023 Forecasting-Theory_and_Practice-arxiv-2012.03854.pdf
- Dec 22 10:29:36 2022 Empirical_Macroeconomics_and_DSGE_Modeling_in_Statistical_Perspective_-_arxiv-2210.16224.pdf
- Sep 1 21:07:05 2022 Varley_-_Flickering_emergences_The_question_of_locality_in_information-theoretic_approaches_to_emergence_-_arxiv-2208.14502.pdf
- Aug 31 13:22:42 2022 Applying_compressed_sensing_to_genome-wide_association_studies_-_arxiv-1310.2264.pdf
- Mar 17 09:27:18 2022 Renzo_Comolatti-Erik_Hoel_-_Causal_emergence_is_widespread_across_measures_of_causation_-_arxiv-2202.01854.pdf
- May 12 08:40:46 2021 EigenGame_Unloaded_When_playing_games_is_better_than_optimizing_-_arxiv-2102.04152.pdf
- May 12 08:11:51 2021 DeepMind_-_EigenGame_PCA_as_a_Nash_Equilibrium_-_arxiv-2010.00554.pdf
- Apr 2 23:20:06 2021 Hidden_Markov_Models_Applied_To_Intraday_Momentum_Trading_With_Side_Information_-_arXiv.2006.08307.pdf
- Oct 16 19:56:13 2020 NNT_Taleb_-_Election_Predictions_as_Martingales_An_Arbitrage_Approach_-_arxiv-1703.06351.pdf
- Jul 27 12:34:37 2020 Ole_Peters_-_Alex_Adamou_-_Leverage_efficiency_-_arXiv-1101.4548.pdf
- Jun 25 11:35:55 2019 Kakushadze_Yu_-_Statistical_Risk_Models_-_arxiv_1602.08070.pdf
- Jun 14 10:00:57 2019 Ergodicity-breaking_reveals_time_optimal_economic_behavior_in_humans_-_arxiv-1906.04652.pdf
- Jun 13 12:09:46 2019 Kakushadze_-_Altcoin-Bitcoin_Arbitrage_-_arxiv_-_1903.06033.pdf
- Mar 27 20:15:47 2019 Neural-Ordinary-Differential-Equations-arxiv-1806.07366.pdf
- Mar 27 19:56:35 2019 Polynomial-Regression-Alternative-to-Neural-Nets-arxiv-1806.06850.pdf
- Nov 19 21:17:09 2012 Forecasting-with-time-varying-vector-autoregressive-models-arxiv-0802.0220v2.pdf