I just read this very interesting article -- it talks about the life of human (especially women) data annotators based in small towns and villages in India. They are the lifeline behind the most unglamorous part of the AI pipeline: data annotation, or labeling. I also learnt that India is one of the worldโ€™s largest markets for data annotation labor!

Read More

Volumes of crude data are available at our fingertips today, and the latest concept of a #DataLake helps store any type or volume of data as-is, process it in real-time or batch mode, and analyze it at scale ๐Ÿคฝ๐Ÿงต๐Ÿ‘‡

Read More

Machine Learning models are as good as the data they consume๐ŸดData impacts performance, fairness, robustness & scalability of #ML Systems. If not taken care of, it leads to a TON of tech debt over time in a corporate setting, downstream effects of which are termed as DATA CASCADES ๐ŸŒŠ ๐Ÿงต๐Ÿ‘‡๐Ÿป

Read More

I have been professionally working as a Machine Learning Engineer since more than 2 years now and also, recently co-authored a book titled โ€œSculpting Data for ML: The first act of Machine Learningโ€. My past few experience have taught me that data does not get its due limelight in #MachineLearning as compared to complex model architecture. Keeping up with 'more data beats clever algorithms, but better data beats more data', here are top 5 tips for polishing the dataset to effectively solve #ML problems ๐Ÿค–๐Ÿ‘‡๐Ÿป

Read More
Hits