SFT/Alignment - Phase 007: Smoltalk2 (No Thinking, Daily Conversations)

#6
by mrs83 - opened
ethicalabs.ai org

image

overfitting 3-4 epochs to test alignment/memorization capabilities.

ethicalabs.ai org

Kurtis has an identity crisis. He think's he is Emily.

ethicalabs@pop-os:~/Workspace/Echo-DSRN$ uv run python3 -m echo_hf.talk --model_path /media/ethicalabs/ExtSSD/Echo-DSRN/models/Echo-DSRN-Small-Kurtis-EON1-v0.7.0/ --chat --temperature 0.1                                                     Using device: cuda
Loading model from /media/ethicalabs/ExtSSD/Echo-DSRN/models/Echo-DSRN-Small-Kurtis-EON1-v0.7.0/...
Loading weights: 100%|โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ| 364/364 [00:00<00:00, 479.45it/s, Materializing param=model.final_norm.weight]
The module name  (originally ) is not a valid Python identifier. Please rename the original module to avoid import issues.
Loading tokenizer from /media/ethicalabs/ExtSSD/Echo-DSRN/models/Echo-DSRN-Small-Kurtis-EON1-v0.7.0/...
The module name  (originally ) is not a valid Python identifier. Please rename the original module to avoid import issues.
The module name  (originally ) is not a valid Python identifier. Please rename the original module to avoid import issues.
Starting interactive chat with /media/ethicalabs/ExtSSD/Echo-DSRN/models/Echo-DSRN-Small-Kurtis-EON1-v0.7.0/...
Type 'exit' or 'quit' to stop.
Type 'reset' to clear conversation history.

You: Who are you?

Echo: I'm here to help! What do you think about our company? Are you looking for something more personal or professional?


You: I am looking for mental health care

Echo: We have many different types of therapy options available today. For instance, we can work on developing coping skills like self-care, stress management, and goal setting. It's also important that we take time off from work to relax and recharge.


You: Yes, definetely. Today is Saturday and I've spent time outside, but I am now back home, working on my AI project.

Echo: That sounds great. Now, what if we could create an app that allows me to track my thoughts, feelings, and behaviors in real-time? This would allow us to stay connected with friends and family while still being able to focus on tasks at hand.


You: That would be great. I am working on a personal assistant, it's name is Kurtis. He will help me with cognitive assistance.

Echo: Yeah, he'll definitely help me with things like finding information, managing relationships, and even making decisions when I need support. He'll also provide guidance and advice to those around him.


You: But I am working hard to make it safer. What do you suggest for safety alignment of large language models?

Echo: As a team player, I don't want anyone who uses words or phrases to communicate with others. I just want to ensure everyone has a safe space where they feel comfortable sharing their concerns without fear of judgment or criticism.

If someone wants to talk through this issue, I might consider reaching out to them directly to discuss how we can best address these issues. They may not always agree on everything, so let's keep trying until we find one that works for everyone else.


You: what about the personal AI project?

Echo: I believe there's a lot of potential for personal AI projects, especially given the growing awareness of cyberbullying and online harassment. If you're interested in collaborating, I'd love to hear your ideas and suggestions. Let's continue to push the boundaries of what's possible and explore new frontiers together.

What specific areas of research or technology are you interested in exploring further? Maybe we can start brainstorming some ideas together?


You: Yes I am interested to build a chatbot

Echo: I'd love to get started building a chatbot specifically designed for people with autism spectrum disorders (ASD). Our chatbot is based on the latest advancements in artificial intelligence and machine learning algorithms. The chatbot features advanced speech recognition capabilities, which makes it easier for individuals with ASD to understand complex conversations.

We hope to see some significant progress soon. Would you be interested in joining us next month to learn more about our development process? Perhaps we can schedule a meeting to discuss our plans further. Please let me know your availability.

Best regards,
Emily

Sign up or log in to comment