Replies: 3 comments
-
Hello @zeydabadi and everyone! My name is Alice and I am a Computer Science undergrad at Stanford University pursuing the Computational Biology track. I am very interested in joining this project as I am passionate about contributing to the intersection between AI and health.
I have a strong foundation in Python and HTML/CSS, and also have experience in C, PHP, and JavaScript. I tried working on the Kaggle Competition for EEG recognition. However, as my studies took too much of my attention, I have not been able to fully dive deep into this matter. It would be so interesting and fulfilling to get that opportunity this summer! I look forward to discussing this project further with you! Warm regards, |
Beta Was this translation helpful? Give feedback.
-
Hey @zeydabadi and everyone else, This is Aryan, a pre-final year undergraduate student majoring in Computer Science at BITS Pilani.Your project on developing an open-source EEG Foundation Model has sparked my interest immensely. The prospect of contributing to such a noble endeavor resonates strongly with both my academic pursuits and personal aspirations. Coming to my experience, I have worked on supervised methods of ML like Semi-Supervised and Self-Supervised. Temporal data(Financial Data in my case), LLMs, Retrieval Augmented Generation, building small-scale LLMs and playing around with the embeddings ;) My journey in the field of machine learning and deep learning has been both enriching and inspiring. I am profoundly fascinated with the potential of Large Language Models (LLMs) and their applications across various domains. My experience spans working with frameworks like PyTorch, TensorFlow, and Keras, where I've worked on advanced algorithms to tackle diverse challenges. My research revolves around harnessing the power of LLMs, with a focus on improving model interpretability and explainability—a crucial aspect in ensuring the reliability and trustworthiness of AI systems. I have a solid understanding of the complexities of handling temporal data, which is crucial considering EEG data's inherently temporal nature. My previous work with semi-supervised learning and working with Medical Data(with U-NET) has equipped me with valuable insights that I believe will be beneficial in developing the EEG Foundation Model. I am eager to bring my skills and knowledge to this project and collaborate with like-minded individuals to contribute to EEG research. Thank you for considering my interest, and I look forward to participating in this innovative endeavor. I tried to delve into some existing Papers on Foundational models for EEG, Here’s what I found so far: NEURO-GPT: DEVELOPING A FOUNDATION MODEL FOR EEG: This paper talks about Neuro-GPT, a foundation model that merges an EEG encoder with a GPT model. Pre-trained on extensive EEG datasets, it learns to reconstruct masked segments, capturing complex EEG patterns. The EEG encoder extracts features, while GPT predicts the next masked chunk. LARGE BRAIN MODEL FOR LEARNING GENERIC REPRESENTATIONS WITH TREMENDOUS EEG DATA IN BCI: The study introduces the Large Brain Model (LaBraM) as a comprehensive framework for EEG analysis. LaBraM facilitates cross-dataset learning by dividing EEG signals into EEG channel patches and utilizes vector-quantized neural spectrum prediction to encode these patches into concise neural representations, drawing inspiration from image patch embeddings. To enhance the model's ability to learn generic representations from extensive EEG data, the authors propose a masked EEG modeling approach, employing a symmetric masking strategy to enhance training efficiency. Subsequently, the efficacy of LaBraM is assessed through evaluations on various downstream tasks such as TUAB and TUEV. Experimental results demonstrate that LaBraM surpasses SOTA methods in their respective domains. Data scaling remains one of the problems mentioned in the paper. These are something I found upon research and I am open to suggestions and interaction on these. Thank you! |
Beta Was this translation helpful? Give feedback.
-
Development of an Open-Source EEG Foundation ModelAbstract The aim of the project is to develop an open-source foundation model which will be able to help the analysis of Electroencephalography (EEG) data. The main activities will be on investigating and adopting several algorithms from data extractions, patterns recognition and deep learning on a public dataset of EEG data. Contributor Carmine Sacco Potential Mentors
My Background Last but not least I attended some Coursera Courses from Andrew NG about the strategies to develop ML models in particular when your dataset is small, such as transfer learning which is based on the idea of fine tuning a model working for the same scope (pattern recognition from images) but with a larger dataset. In conclusion, I gained expertise from several sectors and environments (large companies, startups and universities) but I would like to join into Google Summer of Code program because it's huge opportunity to gain hands-on experience on real-world problems and start to switch my careers. Project Goals
Major Contributions
Project Schedule Since the project is 350 hours and you consider 35 hours per week, I suppose that totally there are 10 weeks which I'd like to develop in this way 1st - 2nd Weeks: Collect the public EEG datasets and all papers about the state of art of ML models for detection of diseases from medical images and/or EEG detection 3rd - 4th Weeks: Data anylisys of EEG (clean, extract and transform) + implement into a notebook the most promising models 5th - 6th Week: Make tests on EEG dataset and make a report for the different results you got on several models. Start to think how you can improve the model's precision, training. 7th - 8th Week: Implement your ideas and make the reports about the results. 9th - 10th Week: Clean Code and write a Paper. Planned GSoC work hours Full-time Project so at least 35 hours per week. If I can work full-remote from Italy, I would like to work from 10 am to 2pm CEST and from 3pm to 7pm CEST but I am flexible to adapt half day to your timezone. Vacation days Maybe I can take one week in the middle of August but it will depend by my progresses and achievements during these phases Skills set Conclusion Best regards, Carmine Sacco |
Beta Was this translation helpful? Give feedback.
-
Hello @zeydabadi !
I am Mohan a student of IIT BHU. I am interested in contributing to this project. This project can help doctors for faster EEG analysis which can help to avoid many psychological disorders.
I have experience with python and frameworks like Pytorch and TensorFlow. I have done internship related to large language models specifically For AI security for integrating security to stop prompt injection in the system. I have experience with fine tuning and have done some fine tuning of stable diffusion for producing icons Link. I have experience with self supervised learning methods like SimCLR , BYOL etc. I have some experience with signal processing which I think will be very important for Preprocessing of EEG data.
I have done some research regarding the project. I have read some research some papers regarding it and found some research papers very useful related to the project. As self supervised learning has proved its capabilities of learning the intricacies hidden in the data. It will play a key role in the foundation model.
The main requirements of our project to make a foundation model are as follows:
It is my research till now around this topic. I am open to any suggestions and discussions in the comments. I am looking forward to collaborate with amazing folks and to contribute to this project.
Mohan (mohan.kumar.min22@itbhu.ac.in)
Beta Was this translation helpful? Give feedback.
All reactions