Once upon a time, in the land of crypto, eager developers toiled away on their creations, crafting intricate systems and complex mechanisms. They were filled with hope that users from all corners of the world and the World Wide Web would soon flock to partake. However, many a promising technological wonder found itself cold and […]
Trinique’s TNQ token has hit a major milestone by breaking $1 million in trading volume since it was just listed on Coinstore. This accomplishment shows that investors are becoming more engaged and interested in TNQ’s product. The token is now more widely accessible and is becoming more well-known in the cryptocurrency community thanks to its […]
As large language model-based workflows become both more sophisticated and more widespread, we’re seeing a growing number of novel approaches that help practitioners tailor (and improve) the models’ performance to specific projects and use cases. Many of our best-read articles in the past month zoomed in on this trend, with excellent guides for both novices and experiences users.
Our monthly highlights go beyond the exciting world of LLMs to explore other topics that remain top of mind for many data and ML professionals—from solidifying their math skills to streamlining error messages in Python. We hope you carve out some time over the next few days to discover (or revisit) some of our most popular articles from March. Let’s dive in!
Monthly Highlights
Intro to DSPy: Goodbye Prompting, Hello Programming! Few recent tools have generated as much excitement as DSPy, a powerful open-source framework for algorithmically optimizing prompts and weights. Leonie Monigatti brought her signature clarity and practical approach to this topic, and her beginner-friendly guide attracted the largest readership on TDS this month.
How to Learn the Math Needed for Data Science How much math knowledge should data scientists accumulate in order to do well on their job? The yearslong debate rages on, but for anyone who’s still in the process of building their fundamental skills, Egor Howell’s primer—which comes with ample resources and tips—is a great place to start.
Why LLMs Are Not Good for Coding AI-assisted programming is not exactly new, but talk about the imminent disappearance of developers has become a lot more common in the past year or so. Depending on your perspective, Andrea Valenzuela’s assessment of LLMs’ current limitations will be either sobering or comforting; testing ChatGPT’s abilities, she concludes that “it often struggles to generate efficient and high-quality code.”
Visualize your RAG Data — Evaluate your Retrieval-Augmented Generation System with Ragas Evaluating the performance of retrieval-augmented generation (RAG) systems is essential, but often tricky. In his TDS debut, Markus Stoll walks readers through the basics of working with Ragas, a framework that facilitates RAG pipeline evaluations, and pays particular attention to visualizing the results effectively.
Building Your First Desktop Application using PySide6 [A Data Scientist Edition] For anyone in the mood for tinkering, but less passionate about LLMs, why not try a different type of project? Arunn Thevapalan presented a step-by-step guide to building a functional desktop app with PySide6, a skill that can prove useful for data professionals in a wide range of contexts—especially when sharing your work with other stakeholders is crucial.
How to Generate Instruction Datasets from Any Documents for LLM Fine-Tuning We’re not quite done with LLMs just yet! Collecting data for fine-tuning these models can be time-consuming and costly; as a potential workaround, Yanli Liu proposes an innovative approach: automating the creation of instruction datasets from various documents with the aid of Bonito, an open-source library.
What I Learned in My First 3 Months as a Freelance Data Scientist “It really comes down to this: I get to pick what I work on, when I work on it, and for whom I am working.” After a long data science career at a wide range of companies, CJ Sullivan decided to switch tracks and become a freelancer; her latest article offers insightful reflections and pragmatic pointers for anyone else who might be considering a similar transition.
Say Goodbye to Confusing Python Error Messages Spending less time debugging your code is a perennial goal for developers and data scientists alike. One element that can make a real difference on that front is working with clearer and more actionable error messages, something you can achieve by exploring Christopher Tao’s detailed guide on the open-source PrettyErrors library.
“Do you believe in the multiverse?” our petite and cheerful guide Angelina asked me when I told her what I do for a living, while navigating the rambunctious street’s of Hanoi’s Old Quarter. It was not a conversation I was expecting to have on a vegan street food tour in Vietnam, but as Angelina studies AI and VR (and as we are both avid Marvel fans), our chat took a turn down a dimensional rabbit hole. I wonder what kind of questions she would have for the founders of Multiverse Computing — a Spanish deeptech scaleup. It offers what it…
The Omni S1 Pro is eufy’s new robot vacuum and mop combo that introduces a plethora of intelligent features. It’s available to order on Kickstarter now!
We use cookies on our website to give you the most relevant experience by remembering your preferences and repeat visits. By clicking “Accept”, you consent to the use of ALL the cookies.
This website uses cookies to improve your experience while you navigate through the website. Out of these, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may affect your browsing experience.
Necessary cookies are absolutely essential for the website to function properly. These cookies ensure basic functionalities and security features of the website, anonymously.
Cookie
Duration
Description
cookielawinfo-checkbox-analytics
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional
11 months
The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance
11 months
This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy
11 months
The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
Functional cookies help to perform certain functionalities like sharing the content of the website on social media platforms, collect feedbacks, and other third-party features.
Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.
Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics the number of visitors, bounce rate, traffic source, etc.
Advertisement cookies are used to provide visitors with relevant ads and marketing campaigns. These cookies track visitors across websites and collect information to provide customized ads.