Blogs


I write blogs from time to time to introduce some of my new research topics or reflect on past research. (Opinions expressed are my own.)


decentralized_arena

Decentralized Arena via Collective LLM Intelligence


Building Automated, Robust, and Transparent LLM Evaluation for Numerous Dimensions
Oct, 2024

This blog introduces Decentralized Arena (DeArena), the first scalable, automated LLM evaluation system, expanding and refining the "Chatbot Arena" concept across a wide range of dimensions. By enabling LLMs to evaluate each other, DeArena fosters a transparent, autonomous, and reproducible approach to AI benchmarking, minimizing bias and computational demands through an efficient sorting-based algorithm that outperforms traditional methods. Its potential reaches into the oversight of future superintelligence, offering a democratic, collective intelligence-driven alternative when human judgment becomes insufficient or unreliable.


txt360_from_llm360

TxT360: A Top-Quality LLM Pre-training Dataset Requires the Perfect Blend


Oct, 2024

This blog introduces TxT360 (Trillion eXtracted Text), a large-scale pre-training dataset from LLM360. It is the first dataset to globally deduplicate 99 CommonCrawl snapshots and 14 high-quality data sources from diverse domains (e.g., FreeLaw, PG-19, etc.). We released this blog to explain every detail of how TxT360 was produced. Participating in this great project and leading several initiatives gave me better insights into how data scaling works in balancing noisy web and highly curated data.


chatgpt-year-1

Reflecting on ChatGPT’s First Year: Evolutions, Twists, and Smooth Directions


Jan, 2024

About one year after ChatGPT's release, this blog reflects on its impact on LLM research, examining which new topics have emerged and which older ones have become less relevant. What surprised me most is how rapidly the field has evolved in just one year. I highlighted several "twisted" directions, meaning areas where there has been an unexpected surge or decline in academic interest. In contrast, some "smooth" directions have consistently received significant research attention. By better understanding these shifts in research focus, we can more accurately measure the influence of ChatGPT-related techniques and more effectively prioritize the impactful areas we want to explore next.

Stay tuned for the blog, Reflecting on ChatGPT's Second Year at the end of 2024!