Open LLMs: A Tale Of Two Licenses

Databricks Open Model License Llama 2 License   Differences in Restrictions Modification: The Databricks license explicitly requires stating that modifications have been made, while the Llama 2 license does not. Updates: The Databricks license requires making reasonable efforts to use the latest version of the model, which the Llama 2 license does not mention. IfContinue reading “Open LLMs: A Tale Of Two Licenses”

Apple’s ReALM: Making Virtual Assistants More Intuitive and Helpful in Everyday Life

Apple’s ReALM presents a groundbreaking approach to reference resolution, harnessing the power of large language models (LLMs) to revolutionize how conversational AI systems interpret user queries. By expanding the scope beyond traditional textual references to include on-screen and background entities, ReALM grants virtual assistants the ability to “see” and comprehend the visual world, leading toContinue reading “Apple’s ReALM: Making Virtual Assistants More Intuitive and Helpful in Everyday Life”

Taming the Unstructured Beast: Data Tools for Unleashing Generative AI

Subscribe • Previous Issues Is Your Data Strategy Ready for Generative AI? The deeper I explore Generative AI (GenAI) and Large Language Models (LLMs), the more I confront the complexities of integrating custom data to improve their performance. Whether it’s through fine-tuning or building retrieval augmented generation (RAG) systems, the success of these applications hinges on ourContinue reading “Taming the Unstructured Beast: Data Tools for Unleashing Generative AI”

Is Your Data Strategy Ready for Generative AI?

The deeper I explore Generative AI (GenAI) and Large Language Models (LLMs), the more I confront the complexities of integrating custom data to improve their performance. Whether it’s through fine-tuning or building retrieval augmented generation (RAG) systems, the success of these applications hinges on our ability to harness the power of unstructured data locked awayContinue reading “Is Your Data Strategy Ready for Generative AI?”

Jamba: The LLM with Mamba Mentality

AI21 Labs has introduced Jamba, the world’s first production-grade language model built on a hybrid architecture that combines Mamba Structured State Space (SSM) technology with elements of the traditional Transformer architecture. This innovative approach addresses the limitations of pure Transformer or SSM models, offering significant improvements in memory footprint, throughput, and the efficient handling ofContinue reading “Jamba: The LLM with Mamba Mentality”

The Efficient Frontier of LLMs: Better, Faster, Cheaper

Since the release of ChatGPT in November 2022, there has been an explosion of interest in large language models (LLMs), with numerous open-source and proprietary models entering the market. As competition intensifies, LLM providers are increasingly focusing on efficiency as a key differentiator to attract users and stay ahead of the curve. In the raceContinue reading “The Efficient Frontier of LLMs: Better, Faster, Cheaper”

Apple vs. DOJ: Weighing the Arguments in the Lawsuit Against Apple

As a seasoned observer of the tech industry, the recent lawsuit filed by the U.S. Department of Justice (DOJ) against Apple, which accuses the company of wielding an iPhone monopoly, presents an important examination of competition and innovation within the smartphone sector. The DOJ’s complaint leverages striking statistics, highlighting Apple’s commanding 70% market share inContinue reading “Apple vs. DOJ: Weighing the Arguments in the Lawsuit Against Apple”

Boosting RAG Systems with Knowledge Graphs: Early Insights

In a previous post, I explored the potential of knowledge graphs (KGs) for enhancing language models (LLMs). Building on that, I have collected results from early studies on the use of KGs in retrieval augmented generation (RAG) systems. We are beginning to see the integration of KGs and RAG, particularly in the case of structuredContinue reading “Boosting RAG Systems with Knowledge Graphs: Early Insights”

Exploring the Efficient Frontier of LLMs

Subscribe • Previous Issues The Efficient Frontier of LLMs: Better, Faster, Cheaper Since the release of ChatGPT in November 2022, there has been an explosion of interest in large language models (LLMs), with numerous open-source and proprietary models entering the market. As competition intensifies, LLM providers are increasingly focusing on efficiency as a key differentiator to attractContinue reading “Exploring the Efficient Frontier of LLMs”