In the contemporary world of learning algorithms – “data is the new oil”. Data demands efficient refinement to expose valuable information. To lay a strong foundation for the state-of-the-art machine learning algorithms to work their magic, the crude oil-like data needs to be infused with domain knowledge and extracted into “features”. This talk aims to introduce the audience to the subject of Feature Engineering, and talk about the power of the most creative aspect of data science which often does not get its due limelight. It will also walk the audience through the process of feature engineering as done in formal settings with a simple hands-on Pythonic example on publicly available data, along with putting forward some popular techniques like hashing, encoding, and embedding, which assists in pulling the most out of the data after giving it a proper structure for predictive modeling. Terms pertaining to the realm of feature engineering like relevance, selection, combination, and explosion will also be discussed. The goal is to institute the importance of data, especially in its worthy format, and the spell it casts on fabricating smart learning algorithms.
You may also like
Turbocharging Success: From Strategy to Execution with Deep Dive on Data Governance Rollout at Volvo Penta – Sudharshan Ravi, Volvo Penta AB
Session Outline Volvo Penta has embarked on a journey to become a digital and data-driven industry leader in sustainable power solutions. There is strong emphasis on the importance of data, AI and digital services...
Trust Your Data? Why Context is King – Deniz Minican, H&M
Session Outline As data professionals, we work with data every day, but how often do we truly challenge it? How curious are we in our analysis, or when we search for insights? Do we fully grasp the complex systems that...
How DPG Media Leveraged Snowplow’s Customer Data Infrastructure to Enhance Analytics – Karine Caimo, DPG Media & Hannah MacGregor, Snowplow
Session Outline Organizations relying on Google Analytics often face limitation—data delays, black-box insights, and a lack of flexibility. DPG Media made the strategic decision to break free, adopting a real-time, high...
Add comment