Blog

Continual Learning: The Three Common Scenarios

Pascal Janetzky

Plus paper recommendations

Continue reading on Towards Data Science »

Originally appeared here:
Continual Learning: The Three Common Scenarios

Go Here to Read this Fast! Continual Learning: The Three Common Scenarios

October 29, 2024
A Gentle Introduction to the DCIN for Decentralized Inference

Marcello Politi

How a Decentralized Collaborative Intelligence Network works

Continue reading on Towards Data Science »

Originally appeared here:
A Gentle Introduction to the DCIN for Decentralized Inference

Go Here to Read this Fast! A Gentle Introduction to the DCIN for Decentralized Inference

October 29, 2024
Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Yanyan Zhang

Although batch inference offers numerous benefits, it’s limited to 10 batch inference jobs submitted per model per Region. To address this consideration and enhance your use of batch inference, we’ve developed a scalable solution using AWS Lambda and Amazon DynamoDB. This post guides you through implementing a queue management system that automatically monitors available job slots and submits new jobs as slots become available.

Originally appeared here:
Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

Go Here to Read this Fast! Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline

October 29, 2024
Three Crucial Data Lessons That I Learned from a Data Conference That’s Not Related to AI
Nithhyaa Ramamoorthy
Underrated concepts that help foster analytics excellence in organizations

Image generated by Author using AI prompts in Microsoft Co-pilot

Conference attendance has been a frequent occurrence for me as a data professional since my early career days. The field of data science is so vast and diverse. While that means that there is a huge variety of data roles and practitioners out there, it could also mean that no matter how small your niche, or how specific and esoteric your specific problem is, there is always someone else out there with the same problem in a different company. The proof is in the endless number of questions and memes in Stack Overflow /Kaggle threads and other knowledge-base forums. During my early career connecting with the data community out there was so helpful for me to learn novel techniques and apply new knowledge to some of the old problems I was solving and become a more efficient analyst.

“Many ideas grow better when transplanted into another mind than the one where they sprang up.” — Oliver Wendell Holmes.

More recently, I started attending data community gatherings and conferences as a speaker and having a seat at the table with expert panels and the speaker lounges has been a game changer. It has helped me immensely to think creatively about my job and role in Data Stewardship and become a better data mentor and steward for folks who rely on my expertise. I recently attended Data Connect 2024 as a speaker. While most of the conferences happening in the past couple of years have been heavily focused on AI, I was fortunate enough to learn about the following three critical aspects of Data Analytics and management that I could easily apply in my day-to-day responsibilities. In this article, I’ll be sharing my interpretation and learnings from these sessions, action items, and my reflections on these crucial data topics as a data practitioner of over 12 years.

1. Cost containment

The concept of Data ROI gets talked about a lot, but rarely does it get quantified and becomes an official metric that gets tracked and shared out consistently. Cost containment has been on my mind for me both as an Individual Contributor and a Team Lead, starting from my days as an Analytics Intern. Who can forget their first time letting their un-optimized SQL query with full outer joins run for several hours before getting a warning call from their org’s DBA? (Not me !) Ever since then, Cost containment has been one of those concepts that has been living rent-free in my brain. Many data solutions providers have switched from a Tier-based pricing model to a consumption-based pricing model which makes cost optimization an essential tool in the data management and leadership toolkit.

Data teams may face unexpected bills when query optimization is overlooked or when inefficient data practices are in place. For instance, apart from running extensive queries, even failing to archive unused data can lead to substantial increases in data storage costs. To manage such unexpected expenses, it is crucial to implement effective cost management strategies. This includes monitoring usage patterns, optimizing queries, and setting up alerts for unusual activity. By understanding these hidden costs, teams can better control their data tools’ pricing, ensuring a sustainable data ROI. I also learned about the importance of investing in training data teams to optimize their use of analytics tools. Educating staff on cost-efficient query writing and data handling ensures that resources are utilized to their full potential is often overlooked, but it is one of the low-hanging fruits that can be effective in cutting costs.

Another pitfall occurs when data teams fail to accurately forecast their data usage. This oversight can result in substantial unanticipated expenses, particularly when scaling operations rapidly. To avoid these pitfalls, it is essential for organizations to maintain open communication with vendors, closely monitor data usage, and regularly review contract terms. By anticipating changes and preparing for potential cost fluctuations, businesses can better manage their data tools pricing and ensure their data ROI remains positive.

Actionable takeaways:
- Managing Expenses: To effectively monitor expenses, data teams should implement comprehensive dashboards that provide real-time insights into spending patterns. These tools should highlight daily, weekly, and monthly cost trends, enabling teams to spot irregularities swiftly. Additionally, setting up automated alerts for unusual usage or spending spikes can further safeguard against budget overruns.
- Anticipating Fluctuations: Maintain open communication with vendors, closely monitor data usage, and regularly review contract terms. Anticipating changes and preparing for potential cost fluctuations is important to have an accurate forecast of expenses.
- Data and Process Audits: Conduct frequent audits of all data-related expenses to spot savings opportunities without sacrificing data quality.
2. Data translation and other proven ways to demonstrate the value of data teams

While Cost containment is one important piece that influences the Data teams’ ROI, the other side of this coin is measuring the worth and effectiveness of Data Analytics efforts and eventually the data teams. As I was wrapping up my notes from listening to a talk related to Data Analytics Team efficiencies, one of the moderators sparked up an interesting discussion related to “Data Translation” and the need for dedicated organizational efforts to bridge the literacy gaps between Data and Business teams.

Bridging the Data and Business Literacy Gaps: Image generated by Author using AI prompts in Microsoft Co-pilot

As data infrastructure and data team hiring costs have been consistently increasing, it’s crucial for businesses to see a return on this investment. High costs can be justified only if the data team’s work translates into actionable insights that drive business growth, innovation, and efficiency. Without clear value, these expenses can seem burdensome. Data leaders need to ensure that their teams are aligned with business objectives and are working on projects that offer substantial returns. By effectively managing these costs, organizations can maintain a competitive edge and leverage data as a strategic asset. This requires thoughtful allocation of resources, prioritization of impactful projects, and fostering a culture of data literacy to maximize the utility and influence of the data team across the organization.

Actionable takeaways :
- Data Translation: Empower your data team to become great storytellers, always document key outcomes that result from data analyses, and process efficiencies, and share success stories across the organization. Many organizations have a dedicated “Data Translator” role in charge of these responsibilities.
- Know Your Worth: Keep tabs on team costs, be aware of salary trends, and back up your investments with analytics that inspire action.
- Engage Like a Pro: Use tools like a Stakeholder Engagement Matrix. Identify key players and build solid relationships. Your goal? Get everyone on the same page with the company’s strategic goals.
- Map Out the Path: Craft a strategic plan by dreaming big, getting creative, and picking projects that make a significant impact. Remember the 80–20 rule — balance daily upkeep with space for innovation.
- Build Bridges: Boost data literacy across the organization. Offer targeted training, close the knowledge gaps, and empower everyone to use data confidently.
- Think beyond the Silos: Promote inter-departmental teaming up to sync everyone’s priorities. It creates a well-oiled machine where everyone’s efforts resonate with the business’s big picture.
3. New Tactics in Information Design and Data Storytelling

Picture of Author presenting at Data Connect 2024

As I went through the speaker coaching boot camp, I couldn’t help but draw parallels between public speaking tactics and the data storytelling tactics we often learn in our jobs. The process of brainstorming and picking a well-rounded topic, being okay with imperfections, and finally getting it all mapped into an overall governing idea made me reflect on how Data storytelling is much more than pretty charts and documenting patterns. I also got to listen to a great presentation focusing on Effective visual communication tips for data presentations. I learned that by integrating narrative with data visualization, you engage both the logical and emotional sides of your audience’s brain, making your insights more memorable. This approach ensures that data isn’t just seen as numbers on a page but as a critical driver of strategic decisions. I also learned that aligning your data story with the audience’s needs and data literacy is the easiest way to encourage them to take meaningful action based on the data.

Actionable takeaways :
- Clearly define the purpose of the Data story: Let the audience know upfront about the learning objectives and identify the main takeaway. This core message acts as the foundation of your narrative, guiding every decision you make in terms of data visualization and storytelling techniques.
- The governing idea: Find the core message you’re delivering by performing data analysis and fit your data story into a compelling arc. The outline I used for my talk was to identify the problem, make the problem relatable to the audience, solutions and supporting facts and charts for why the solution works, and finally an inspiring end followed by a call to action.
- Supporting Elements Use Visualizations, metrics, and Annotations to support your data story. Every data point used to support your story arc strengthens the core message and plays a significant role in reinforcing your message. It is also important to ensure the accuracy and relevance of the data metrics you use. These should align with your core message and provide insight into the story you’re telling. Use color strategically to highlight key data points and maintain audience focus. Consistency and contrast are key elements in effective color usage.
Conclusion

Attending data conferences is a great way to keep up with the current trends and learn new concepts. While AI-adjacent topics have dominated most of the conference agendas over the past couple of years for very valid reasons, I was deeply grateful to learn about these three crucial Data management advancements that helped solidify my foundational knowledge, Data communication skills, and tap into the collective hive mind of experienced Data Subject matter experts to solve problems that are as ubiquitous as AI.

Note: A big thank you to Data Leaders Kathy Koontz, Lindsey Cohen, Akia Obas, Lyndsey Pereira-Brereton, and many more bright minds for having these thought-provoking discussions with me and inspiring this post.

About the Author :

Nithhyaa Ramamoorthy is a Data Subject matter Expert with over 12 years’ worth of experience in Analytics and Big Data, specifically in the intersection of Healthcare and Consumer behavior. She holds a Master’s Degree in Information Sciences and more recently a CSPO along with several other professional certifications. She is passionate about leveraging her analytics skills to drive business decisions that create inclusive and equitable digital products rooted in empathy.

Three Crucial Data Lessons That I Learned from a Data Conference That’s Not Related to AI was originally published in Towards Data Science on Medium, where people are continuing the conversation by highlighting and responding to this story.
Originally appeared here:
Three Crucial Data Lessons That I Learned from a Data Conference That’s Not Related to AI

Go Here to Read this Fast! Three Crucial Data Lessons That I Learned from a Data Conference That’s Not Related to AI
October 29, 2024
An Agentic Approach to Textual Data Extraction Using LLMs and LangGraph

Youness Mansar

From unstructured Wikipedia text to structured JSON: A step-by-step guide.

Continue reading on Towards Data Science »

Originally appeared here:
An Agentic Approach to Textual Data Extraction Using LLMs and LangGraph

Go Here to Read this Fast! An Agentic Approach to Textual Data Extraction Using LLMs and LangGraph

October 29, 2024
Building Your Own AI Group Chat: A Journey into Custom Universes and Characters

Maxime Jabarian

How I created an immersive AI group chat using Ollama, FastAPI, open-source LLMs, and React

Continue reading on Towards Data Science »

Originally appeared here:
Building Your Own AI Group Chat: A Journey into Custom Universes and Characters

Go Here to Read this Fast! Building Your Own AI Group Chat: A Journey into Custom Universes and Characters

October 29, 2024
Build a video insights and summarization engine using generative AI with Amazon Bedrock

Simone Zucchet

This post presents a solution where you can upload a recording of your meeting (a feature available in most modern digital communication services such as Amazon Chime) to a centralized video insights and summarization engine. This engine uses artificial intelligence (AI) and machine learning (ML) services and generative AI on AWS to extract transcripts, produce a summary, and provide a sentiment for the call. The solution notes the logged actions per individual and provides suggested actions for the uploader. All of this data is centralized and can be used to improve metrics in scenarios such as sales or call centers.

Originally appeared here:
Build a video insights and summarization engine using generative AI with Amazon Bedrock

Go Here to Read this Fast! Build a video insights and summarization engine using generative AI with Amazon Bedrock

October 29, 2024
Automate document processing with Amazon Bedrock Prompt Flows (preview)

Erik Cordsen

This post demonstrates how to build an IDP pipeline for automatically extracting and processing data from documents using Amazon Bedrock Prompt Flows, a fully managed service that enables you to build generative AI workflow using Amazon Bedrock and other services in an intuitive visual builder. Amazon Bedrock Prompt Flows allows you to quickly update your pipelines as your business changes, scaling your document processing workflows to help meet evolving demands.

Originally appeared here:
Automate document processing with Amazon Bedrock Prompt Flows (preview)

Go Here to Read this Fast! Automate document processing with Amazon Bedrock Prompt Flows (preview)

October 29, 2024
Governing the ML lifecycle at scale: Centralized observability with Amazon SageMaker and Amazon CloudWatch

Abhishek Doppalapudi

This post is part of an ongoing series on governing the machine learning (ML) lifecycle at scale. To start from the beginning, refer to Governing the ML lifecycle at scale, Part 1: A framework for architecting ML workloads using Amazon SageMaker. A multi-account strategy is essential not only for improving governance but also for enhancing […]

Originally appeared here:
Governing the ML lifecycle at scale: Centralized observability with Amazon SageMaker and Amazon CloudWatch

Go Here to Read this Fast! Governing the ML lifecycle at scale: Centralized observability with Amazon SageMaker and Amazon CloudWatch

October 29, 2024
Apple stuck the Mac mini power button on the bottom

Don’t expect to fit the new M4 Mac mini into too tight a space because you’re going to have to tip it to reach underneath every time you need to switch it on.

The power button is underneath the new Mac mini — image credit: Apple

If the position of the charging port on the Magic Mouse isn’t the questionable design choice it’s been called, the New Mac mini power button might be. As pointedly not shown by Apple in its launch video or new ad — the Mac mini power button is underneath the device.

As can be seen on the online store’s page for the new M4 Mac mini, the button is not on the very base of the model. It is, though, underneath it, raised off the ground only by the cooling vent.

Continue Reading on AppleInsider | Discuss on our Forums

Go Here to Read this Fast! Apple stuck the Mac mini power button on the bottom

Originally appeared here:
Apple stuck the Mac mini power button on the bottom

October 29, 2024

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Blog

Underrated concepts that help foster analytics excellence in organizations

1. Cost containment

2. Data translation and other proven ways to demonstrate the value of data teams

3. New Tactics in Information Design and Data Storytelling

Conclusion